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ABSTRACT 

The Kilo-Degree Survey (KiDS) is a multi-band imaging survey designed for cosmo¬ 
logical studies from weak lensing and photometric redshifts. It uses the ESO VLT 
Survey Telescope with its wide-held camera OmegaCAM. KiDS images are taken in 
four hlters similar to the SDSS ugri bands. The best-seeing time is reserved for deep 
r-band observations. The median 5-cr limiting AB magnitude is 24.9 and the median 
seeing is below 0.7". 

Initial KiDS observations have concentrated on the GAMA regions near the celestial 
equator, where extensive, highly complete redshift catalogues are available. A total 
of 109 survey tiles, one square degree each, form the basis of the Hrst set of lensing 
analyses of halo properties of GAMA galaxies. 9 galaxies per square arcminute enter 
the lensing analysis, for an effective inverse shear variance of 69 per square arcminute. 
Accounting for the shape measurement weight, the median redshift of the sources is 
0.53. 

KiDS data processing follows two parallel tracks, one optimized for weak lensing mea¬ 
surement and one for accurate matched-aperture photometry (for photometric red- 
shifts). This technical paper describes the lensing and photometric redshift measure¬ 
ments (including a detailed description of the Gaussian Aperture and Photometry 
pipeline), summarizes the data quality, and presents extensive tests for systematic 
errors that might affect the lensing analyses. We also provide first demonstrations of 
the suitability of the data for cosmological measurements, and describe our blinding 
procedure for preventing confirmation bias in the scientific analyses. 

The KiDS catalogues presented in this paper are released to the community through 
http://kids.strw.leidenuniv.nl. 

Key words: cosmology: observations - gravitational lensing: weak - galaxies: pho¬ 
tometry - surveys 
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1 INTRODUCTION 

Measurements of the gravitational fields around galaxies 
have for many decades provided firm evidence for ‘dark mat¬ 
ter’: galaxies attract their constituent stars, each other and 
their surroundings more strongly than can reasonably be 
estimated on the basis of their visible contents (for a histor¬ 
ical account of the subject see Sanders 2014). Furthermore, 
observations of the temperature anisotropies of the cosmic 
backgronnd radiation show that most of this dark matter 
cannot be baryonic (Planck Collaboration et al. 2015a), in 
agreement with constraints from Big Bang Nucleosynthe¬ 
sis models (e.g.. Fields & Olive 2006). Understanding the 
distribution of matter in the Universe is therefore a funda¬ 
mental task of observational cosmology. The cold dark mat¬ 
ter model, augmented with increasingly sophisticated galaxy 
formation recipes, has been very successful in describing, 
and reproducing the detailed statistical properties of, the 
large-scale distribution of galaxies. Though important issues 
remain, the ACDM model is the baseline for interpreting 
galaxy formation. 

A central role in testing galaxy formation and cosmol¬ 
ogy models is played by observational mass measurements. 
They provided the first evidence for dark matter as mass dis¬ 
crepancies in galaxies (e.g., Bosma 1978; Rubin et al. 1978; 
Faber & Gallagher 1979; van Albada et al. 1985; Fabricant 
et al. 1980; Buote & Canizares 1994) and clusters (Zwicky 
1937). Mass measurements also serve to establish the link 
between the observed galaxies and their dark haloes, whose 
assembly age and clustering is mass-dependent, as is well de¬ 
scribed by the halo model (Cooray & Sheth 2002). Masses 
can be obtained from internal and relative kinematics of 
galaxies and their satellites, from X-ray observations of hy¬ 
drostatic hot gaseous haloes around galaxies and clusters, 
and from strong and weak gravitational leasing. 

While kinematics and X-ray mass determinations usu¬ 
ally require assumptions of steady-state dynamical equi¬ 
librium, gravitational leasing directly probes the projected 
mass distribution. This model-independent aspect of leas¬ 
ing is very powerful, but comes at a price. Strong leasing 
measurements are rare and depend on suitable image con¬ 
figurations and mass distributions. These result in complex 
selection effects which it is essential to understand (Bland- 
ford & Kochanek 1987). Weak leasing, on the other hand, 
is intrinsically noisy and thus requires stacking many lenses, 
except for the most massive galaxy clusters (Tyson et al. 
1984). Over the past two decades, telescopes equipped with 
larger and larger CCD cameras have provided the means to 
make wide-area weak leasing studies possible: most recently 
from the CFHTLenS analysis of the CFHT Legacy Survey 
(Heymans et al. 2012, henceforth H-l-12), which targeted 
galaxies (see for example Coupon et al. 2015), groups and 
clusters (see for example Ford et al. 2015) and the large-scale 
structure (see for example Fu et al. 2014). 

This paper introduces the first lensing results from a 
new, large-scale multi-band imaging survey, the Kilo-Degree 
Survey (KiDS). Like the on-going Dark Energy Survey (for 
first lensing results from DES science verification data see 
Melchior et al. 2015; Vikram et al. 2015) and the Hyper- 
SuprimeCam survey (for first lensing results from HSC see 
Miyazaki et al. 2015), KiDS aims to exploit the evolution of 
the density of clustered matter on large scales as a cosmolog¬ 


ical probe (Albrecht et al. 2006; Peacock et al. 2006), as well 
as to study the distribution of dark matter around galaxies 
with more accuracy than has been possible thus far from 
the ground (e.g., Mandelbaum et al. 2006; van Uitert et al. 
2011; Velander et al. 2014) or space (e.g., Leauthaud et al. 
2012). Unlike DES and HSC, which use large allocations of 
time on 4- and 8-m facility telescopes respectively, KiDS 
uses a dedicated 2.6-m wide-field imaging telescope, specif¬ 
ically designed for exquisite seeing-limited image quality. It 
is also unique in that all its survey area overlaps with a deep 
near-infrared survey, VIKING (Edge et al. 2013), providing 
extensive information on the spectral energy distribution of 
galaxies. 

In de Jong et al. (2015, henceforth dej-|-15) we present 
the public data release of the first KiDS images and cata¬ 
logues. Here we describe the aspects of the survey, data qual¬ 
ity and analysis techniques that are particularly relevant for 
the weak lensing and photometric redshift measurements, 
and introduce the resulting shape catalogues. Accompany¬ 
ing papers present measurements and analyses of the mass 
distribution around galaxy groups (Viola et al. 2015), galax¬ 
ies (van Uitert et ah, in preparation), and satellites (Sifon 
et al. 2015). 

This paper is organized as follows. §2 presents the sur¬ 
vey outline and data quality, as well as the data reduc¬ 
tion procedures leading up to images and catalogues. §3 
describes how the lensing measurements are made, §4 dis¬ 
cusses the photometry pipeline and the derived photomet¬ 
ric redshifts, and in §5 a number of tests for systematic er¬ 
rors in the data reduction are presented. Having demon¬ 
strated that the KiDS data deliver high-fidelity lensing 
measurements, in §6 we calculate the cosmic shear signal 
from this first instalment of KiDS imaging. Our conclu¬ 
sions are summarized in §7. In three appendices we give 
the mathematical detail of the PSF homogenization and 
matched-aperture photometry “GAaP” pipeline, illustrate 
some of the quality control plots that are used in the sur¬ 
vey production and validation, and provide a guide to the 
source catalogues which are publicly available to download 
at http://kids.strw.leidenuniv.nl. 


2 DESCRIPTION OF SURVEY AND DATA 
QUALITY 

KiDS (de Jong et al. 2013) is a cosmological, multi-band 
imaging survey designed for weak lensing tomography. It 
uses the VLT Survey Telescope (VST) on the European 
Southern Observatory’s Paranal observatory. The VST is 
an active-optics 2.6-m modified Ritchey-Chretien telescope 
on an alt-az mount, with a 2-lens field corrector and a sin¬ 
gle instrument at its Cassegrain focus: the 300-megapixel 
OmegaCAM CCD mosaic imager. The 32 CCDs that make 
up the ‘science array’ are 4102 x 2048-pixel e2v 44—82 de¬ 
vices, which sample the focal plane at a very uniform scale 
of 0.213” per 15-micron pixel. The chips are 3-edge but- 
table, and are mounted close together with small gaps of 25- 
85”. OmegaCAM has thinned CCDs, which avoids some of 
the problems inherent in deep depletion devices such as the 
‘brighter-fatter’ effect that introduces non-linearity into the 
extraction of PSF shapes from the images (Melchior et al. 
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Figure 1. Example of high-quality KiDS data obtained with VST/OmegaCAM. PSF SExtractor parameters shown are for the stacked 
r-band image of tile KIDS_132.0_-0.5. Left: direction and strength of the ellipticities of stars in the field. Right: PSF ellipticity (top) and 
FWHM size (bottom) vs. distance from the centre of the image. 


2015; Niemi et al. 2015, see also §3.2.3), or the ‘tree rings’ 
(Plazas et al. 2014). 

In order to maintain good image quality over the large 
held of view OmegaCAM makes use of wavefront sensing. 
For this purpose two auxiliary CCDs are mounted on the 
outskirts of the focal plane, vertically displaced ±2mm with 
respect to the science array. As a result, the star images 
registered on these chips are signihcantly out of focus and 
their shapes and sizes provide the information required to 
monitor and optimise the optical set-up in real-time. Auto- 
guiding of both tracking and held rotation is done using two 
further (in-focus) auxiliary CCDs. For more details on VST 
and OmegaCAM see Capaccioli et al. (2012) and Kuijken 
(2011) and references therein. 

The integrated optical design of the telescope and cam¬ 
era makes for uniquely uniform and high-quality images over 
the full one-square degree held of view, well-matched to the 
seeing conditions on Paranal. An example ‘best-case’ point 
spread function (PSF) measured from a co-added stack of 
hve dithered sub-exposures is shown in Fig. 1, demonstrat¬ 
ing that the system is able to deliver better than 0.6” seeing 
over the full held even in long exposures with low-level el¬ 
lipticity distortion. This benign PSF variation can be mod¬ 
elled well and leads to very low residuals in the galaxy el¬ 
lipticity measurements, (see §3 below). Furthermore, since 
there are no instrument changes on the VST the system is 
mostly stable, and continuously monitored photometrically. 
For a discussion on the long-term photometric stability of 
VST/OmegaCAM see Verdoes Kleijn et al. (2013). 


KiDS is part of a suite of three ESO Public Imaging 
Surveys, which are queue-scheduled together on the VST 
and observed as conditions and visibility allow (Arnaboldi 
et al. 2013). The VPHAS-I- survey (Drew et al. 2014) tar¬ 
gets the Southern Galactic plane with short exposures in 
broad bands and Ha, and the ATLAS project (Shanks et al. 
2015) covers some 5000 square degrees of extra-galactic sky 
in the Southern Galactic Cap to similar depth as the (mostly 
Northern) Sloan Digital Sky Survey (Aim et al. 2014). KiDS, 
by contrast, aims to survey a 1500 square degree area to con¬ 
siderably greater depth, with the specific goal of measuring 
weak gravitational lensing masses for galaxies, groups and 
clusters as well as the power spectrum of the matter distri¬ 
bution on large scales. 

KiDS targets two ~ 10-degree wide strips on the sky: 
an equatorial strip between Right Ascension 10^20™ and 
15^50"* plus the GAMA GOO field between 08^30™ and 
09^30"“, and a Southern strip through the South Galactic 
pole between 22^00™' and 03^30”* (see deJ-M5 for the foot¬ 
print of the survey). It makes use of four broad-band in¬ 
terference Liters, ugri, with bandpasses very similar to the 
SDSS Liters described in Fukugita et al. (1996). The obser¬ 
vations of a particular KiDS tile in any given Liter consist of 
Lve dithered sub-exposures (four in the case of the u band), 
and are taken in immediate succession. This choice means 
that KiDS is not well suited for the study of variable stars or 
supernovae, but it does mean that all data for each tile/Liter 
combination are taken in very similar observing conditions, 
resulting in homogeneous data. The prevailing seeing and 
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Table 1. Observing parameters for the KiDS survey. The longer 
r-band observations are made in the best seeing conditions and 
are used for galaxy shape measurements, while the remaining 
bands are used to measure photometric redshifts. Ranges cover 
> 95 percent of the data. 


Filter 

Exposure 
time (sec) 

Dithers 

Seeing 

(arcsec) 

Limiting 

Magnitude 

Moon 

u 

960 

4 

0.95 ±0.2 

24.2 ±0.2 

dark 

9 

900 

5 

0.8 ±0.2 

25.1 ± 0.2 

dark 

r 

1800 

5 

0.7 ± 0.2 

24.9 ±0.25 

dark 

i 

1080 

5 

0.8 ±0.3 

23.7 ± 0.7 

bright 


sky brightness dictate which observation is scheduled. The 
seeing limits for the different filters are matched to the long¬ 
term Paranal average, such that the deep, best-seeing r-band 
observations can proceed at the same rate as the shallower 
u and g exposures. We summarize the observing parameters 
in Table 1. 

The first weak lensing results from KiDS are based on 
the first two public data releases (deJ-|-15), comprising the 
first 148 square degrees that were observed in all four fil¬ 
ters. 109 square degrees from this data set overlap with the 
unique GAMA spectroscopic galaxy survey (Driver et al. 
2011; Baldry et al. 2014), and this provides the focus of the 
early lensing science analyses. 

A detailed discussion of the data quality can be found 
in dej-|-15; in Table 1 we summarize the key quality indi¬ 
cators of PSF sizes and limiting magnitudes. The PSF size 
distributions reflect that the best dark time is reserved for r, 
with g and u receiving progressively worse seeing time. The 
seeing distribution of the i-band, which is the only filter used 
in bright time, is very broad. Limiting AB magnitudes (cal¬ 
culated as 5 (t in a 2" aperture) in g and r are typically ~25, 
with u significantly shallower. For i band observations, the 
large variation in seeing and sky brightness results in a wider 
variation in limiting magnitude than in the other bands. 

PSF ellipticity is of critical importance for weak lensing 
studies. Tile-by-tile statistics of the mean and standard de¬ 
viation of the PSF ellipticities^ are presented in Fig. 2, and 
show a typical mean ellipticity of 0.055 and scatter 0.035. 
Ellipticities do sometimes vary significantly over the field of 
view, due to focus or alignment errors of the optical system. 
When such errors arise, the most common ellipticity pat¬ 
terns encountered are an increase in ellipticity either in the 
centre or towards the corners of the field, and an increase in 
ellipticity towards one edge. Examples of such PSF elliptic¬ 
ity patterns are illustrated in Fig. 3. 

The KiDS data processing pipeline for lensing builds 
upon the pipeline developed for the CFHTLenS project 
(H-l-12). CFHTLenS reanalysed data from the 154-square 
degree CFHTLS-Wide survey (see for example Fu et al. 
2008), the largest deep cosmological lensing survey com¬ 
pleted to date. It is based on new methods for measuring 
galaxy colours for photometric redshifts, and for obtain- 


^ Note that in this section PSF ellipticity is defined as (1 — q) 
where q = 6/a is the minor-to-major axis ratio of the star images; 
this differs from the lensing definition used later on in this paper. 
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Figure 2. Distribution of mean ellipticities and standard devia¬ 
tions of ellipticities of co-added images in data releases 1 and 2 
of KiDS. The values are based on SExtractor ellipticity mea¬ 
surements of the 500 brightest unsaturated stars in each tile. The 
grey scale indicates the number of survey tiles in each bin. Top: 
r band only; Bottom: data from all filters. 


ing ellipticities, the crucial ingredient for weak lensing. Our 
KiDS analysis uses further refinements of these techniques. 

For historical and practical reasons, KiDS uses different 
data reduction pipelines for the lensing shape measurements 
and for the photometry. The latter is based on the 4-band 
co-added images that are released for general-purpose sci¬ 
ence through the ESO science archive, while the former uses 
a lensing-optimised processing pipeline of the r-band data 
only. Integration of both these pipelines and workflows into 
a single process is underway. Meanwhile, we have taken ad¬ 
vantage of the redundancy to perform cross-checks between 
the different pipelines, for example on star-galaxy separa¬ 
tion, masking and photometric calibration, where possible. 

Weak lensing measurements are intrinsically noise- 
dominated; results therefore rely on ensemble averaging so 
that even small systematic residual shape errors can propa¬ 
gate into the final result and overwhelm the statistical power 
of the survey. For this reason our dedicated shape measure¬ 
ment pipeline (see §3) avoids stacking sub-exposures and re¬ 
sampling of the image pixels. Instead it relies on combining 
the likelihoods of shape parameters from the different sub¬ 
exposures of each source. This part of the reduction was 
performed only on the r-band data, with image calibration 
and processing using the Theli pipeline (Schirmer 2013; Er- 
ben et al. 2013, henceforth E-bl3), and object detection and 
classification, PSF modelling and shape measurements us¬ 
ing the lens&t code (Miller et al. 2013, henceforth M-l-13). 
Before distribution to the team for scientific analysis, the 
shape measurements were ‘sabotaged’ through a blinding 
procedure described in §6.1. 

The multi-colour photometry was performed tile by tile 
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Figure 3. PSF ellipticity patterns caused by a non-optimal op¬ 
tical configuration of the telescope. The curved focal plane of the 
VST translates any primary mirror astigmatism into increased 
ellipticity in the centre of the field (top). A tilt of the secondary 
mirror results in increased ellipticity near one edge of the field 
(bottom panel). 

on stacked images for each of the four bands. This part of 
the reduction made use of the Astro- WISE environment 
(Begeman et al. 2013) and optical reduction pipeline (Mc¬ 
Farland et al. 2013). These multi-band images are released 
to the ESO archive as part of the second KiDS data release, 
as described in deJ-|-15. The lensing-quality reduction of the 
r-band imaging is made available on request. 


3 KiDS GALAXY SHAPES FOR LENSING 

As the lensing data processing of KiDS is built upon the 
pipeline developed for CFHTLenS, we refer the reader to the 


CFHTLenS technical papers (H-l-12; M+13; E-l-13) for de¬ 
tailed descriptions of the lensG-t and Theli implementation. 
In this section we highlight the differences and improvements 
implemented for this first KiDS lensing analysis. 

3.1 Lensing-quality Theli r-band data reduction 

Our reduction of OmegaCAM data starts from raw data 
provided by the ESO archive. Most of the processing algo¬ 
rithms used are similar to those initially developed for the 
wide-held imager on the ESO 2.2-m telescope at La Silla, as 
described in Erben et al. (2005). A more in-depth descrip¬ 
tion with tests on the Theli data products will be published 
in Erben et al. (in preparation). 

The Theli processing consists of the following steps: 

(i) The basis for all Theli processing is formed by all 
publicly available OmegaCAM data at the time of process¬ 
ing. All data are retrieved from the ESO archive^. 

(ii) Science data are corrected for crosstalk effects. We 
measure significant crosstalk between CCDs #94, #95 and 
#96^ (deJ-l-15). Each pair of these three CCDs show posi¬ 
tive or negative crosstalk in both directions. We found that 
the strength of the flux transfer significantly varies on short 
time-scales and we therefore determine new crosstalk coef¬ 
ficients for each KiDS observing block (maximum duration 
ca. 1800s). 

(iii) The characterisation and removal of the instrumen¬ 
tal signature (bias, flat field, illumination correction) is per¬ 
formed simultaneously on all data from a two-week period 
around each new-moon and full-moon phase. Each two-week 
period of dark or bright time defines an OmegaCAM process¬ 
ing run (see also section 4 of Erben et al. 2005), over which 
we assume that the instrument configuration is stable. The 
processing run definition by moon phase also naturally cor¬ 
responds to the observations with different filters (u, g and 
r in dark time and i during bright time). 

(iv) Photometric zero-points, atmospheric extinction co¬ 
efficients and colour terms are estimated per complete pro¬ 
cessing run. They are obtained by calibration of all science 
observations in a run that overlap with the Data Release 10 
of the SDSS (Ahn et al. 2014). Between 30 and 150 such 
images, with good airmass coverage, are available per each 
processing run. 

(v) If necessary we correct OmegaCAM data for occa¬ 
sional electronic interference which produces coherent hori¬ 
zontal patterns over the whole field of view. 

(vi) As the last step of the run processing we subtract 
the sky from all individual chips. The resulting single-CCD 
sub-exposures, 160 per r-band tile, form the basis for the 
later shape analysis with lensGt. 

(vii) All science images belonging to a given KiDS point¬ 
ing are astrometrically calibrated against the 2MASS cata¬ 
logue (Skrutskie et al. 2006). At present we only use KiDS 
data belonging to each individual pointing for its astromet¬ 
ric calibration. A more sophisticated procedure, taking into 
account overlaps from adjacent pointing as well as data from 
the overlapping ATLAS survey (Shanks et al. 2015), will be 

^ ESO data archive: http: //archive. eso. org 
® Note that the OmegaCAM CCD’s have names ESO_CCD_#65 
to #96, see dej-|-15 for their layout in the focal plane. 
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included in the future and should constrain the astrometric 
solution further near the edges of each tile. 

(viii) The astrometrically calibrated data are co-added 
with a weighted mean algorithm. The identification of pix¬ 
els that should not contribute, for example those affected by 
cosmic rays, and weighting of usable pixels is determined as 
described in E+13. 

(ix) Finally, SExtractor (Bertin & Arnouts 1996) is 
run on the co-added image to generate the source catalogue 
for the lensing and matched-aperture photometry measure¬ 
ments. 

The final products of the Theli processing are, for each 
tile, the single-chip r-band data, the corresponding co-added 
image with associated weight map and sum image, and a 
source catalogue (see also E-l-13 for a more detailed de¬ 
scription of these products). These images are made publicly 
available on request. 

3.2 Point spread function 

Knowledge of the point spread function (PSF) is essential 
for any weak lensing analysis, since the PSF modifies galaxy 
shapes. The thousands of stars recorded in every KiDS tile 
provide samples of the PSF across the held. The hrst steps 
are to identify these stars among the many galaxies in each 
image, and to build a PSF model from them. 

3.2.1 Star selection 

High-density, spatially homogeneous and pure star cata¬ 
logues are required to construct a good PSF model across the 
held of view. We outline in this section how we classify stars 
in order to meet these requirements. We start by creating a 
source detection catalogue for each of the 5 sub-exposures 
in a KiDS held, using SExtractor with a high detection 
threshold. For each sub-exposure, and every detected object 
for which FLUX_AUTO has a signal-to-noise ratio (SNR) 
larger than 15, we then measure the second-order moments 
Qij and the axisymmetric fourth order moment J given by 

^ _ Jd^xW{x)I{x)xiXj 

Jd^xW{x)I{x) ’ ^ ^ 

^ J (fxW{x)I{x) \x\‘^ 

J A^x W{x)I(x) 

In the above equations I{x) is the surface brightness of the 
object at position x measured from the SExtractor posi¬ 
tion of the object, and W(x) is a Gaussian weighting func¬ 
tion which we employ to suppress noise at large scales. The 
width of the weighting function is fixed and we choose it to 
have a dispersion of 3 pixels, motivated by the typical seeing 
value of our r-band data (~ 0.7"). 

Defining Q — Q11 + Q22, we note that and are 
two different measures of the size of an object, and the ratio 
between these two quantities depends on the concentration 
of the object’s surface density profile. These two parameters 
therefore efficiently classify sources according to their sizes 
and luminosity profiles. 

Fig. 4 shows the distribution of detected objects as a 
function of their second and fourth order moments for the 


different sub-exposures in an example tile. We see that galax¬ 
ies are scattered over a wide range of {Q, J) values whereas 
point sources cluster in a very compact region with low 
and The width of this region depends on how strongly 
the PSF varies across the field of view. 

We identify stars in the plane by locating the 

compact over-density with a ‘friends of friends’ algorithm. 
The fixed linking length was empirically determined from a 
sample of the data. We require the final star catalogue to 
contain the largest possible number of objects while min¬ 
imising contamination by galaxies, as assessed visually by 
inspecting the stellar-locus in the (half-light radius, mag¬ 
nitude) plane, shown in the right panel of Fig. 4. In order 
to minimise the effect of the PSF variation across the held 
of view we perform this search in each individual CCD and 
sub-exposure separately. This automated method is a signif¬ 
icant improvement over the approach taken by CFHTLenS, 
where the stellar locus was visually identihed for each chip 
using data from the co-added image, for every tile in the full 
survey. 

In a hnal cleaning stage, we combine the 5 star cat¬ 
alogues for each chip and we count how many times each 
object has been classihed as a star. The hnal star catalogue 
requires that an object be classihed as a star in at least 3 
out of the 5 sub-exposures. In the cases where the object 
is not observed in all sub-exposures, for example when the 
object lands in a chip gap or at the edge of the held due 
to the dithering, we only require the star to be classihed 
as such once. In Appendix B1 on quality control. Fig. B1 
shows an example distribution of the selected stars across 
the held of view. Plots such as these are inspected for each 
held to ensure that the stellar classihcation is producing a 
spatially homogeneous catalogue. Conhrmation of the purity 
of our star catalogue comes from the PSF modelling where 
typically less than 1 percent of the objects are rejected as 
outliers at that stage. 

3.2.2 PSF modelling 

For each KiDS sub-exposure, we construct a PSF model 
that describes the position-dependent shapes of the identi¬ 
hed stars. The PSF model is expressed as a set of amplitudes 
on a 32 X 32 pixel grid, sampled at the CCD detector resolu¬ 
tion and normalized so that their sum is unity. The variation 
of each pixel value with position in the held takes the form 
of a two-dimensional polynomial of order n, with the added 
hexibility that the lowest-order coefficients are allowed to 
diher from CCD to CCD: this allows for a more complex 
spatial variation of the PSF and also, in principle, allows 
for discontinuities in the PSF between adjacent detectors. 
If the polynomial coefficients up to order ric are allowed to 
vary in this way, then the total number of model coefficients 
per pixel is 

NcoeS = 2 1 )(^ + 2) -|- (Nd — 1)(Rc + 1)(Rc + 2)] (3) 

with Ad = 32, the number of CCD detectors in Omega- 
CAM. The coefficients for each PSF pixel are htted inde¬ 
pendently and a check is made that the total PSF normal¬ 
isation is unity at the end of the htting process. The hux 
and position of each star are also allowed to be free param¬ 
eters in the ht, with the stars aligned to the pixel grid of 
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Figure 4. Automatic star-galaxy separation based on the second and fourth order moment radii and of individual sources, 

for a typical KiDS observation. Five out of the six square panels show the distributions for the individual sub-exposures, with the objects 
identified as stars shown in red. As the seeing differs between the sub-exposures, the combined distribution for the observation, in the 
sixth square panel, reveals a series of distinct stellar peaks. The right-most panel shows the distribution of these points in the traditional 
radius-magnitude plane for the co-added image. 


the PSF model using a sine function interpolation. This ap¬ 
proach allows a great deal of flexibility in the PSF model: 
in particular it does not imprint any additional basis set 
signature on top of the detector pixel basis. The total num¬ 
ber of coefficients is large, but is well constrained by the 
large number of data measurements (number of pixels times 
number of stars) in each sub-exposure. Only stars with a 
high SNR should be used for constructing the PSF model, 
because otherwise noise on the measurement of the stellar 
positions will bias the model towards larger sizes. 

In order to optimise the functional form of the PSF 
model, we selected 10 KiDS fields at random and analysed 
the five r-band sub-exposures in each field, varying the poly¬ 
nomial orders n and ric- We characterise the PSF ellipticity 
epSF and size RpsF of the pixelised model and data as 

Qii — Q22 + 2i(3i2 ... 

gpSp ^ - — (4) 

Qll + Q22 + 2 y/ Q11Q22 — Qi2 

RpSF = \j Q11Q22 — Qi2 ( 5 ) 

(cf. Eq. 1), with the weight function W{x) set to a Gaussian 
of dispersion two pixels. 

For an accurate PSF model the residuals JepsF = 
epsF (model) — epsF(data) and dRpsp = Rpgp (model) — 
J?pgF(data) should be dominated by photon noise, and 
therefore uncorrelated between neighbouring stars. Follow¬ 
ing Rowe (2010) we therefore seek to miminise the PSF el¬ 
lipticity residual auto-correlation, with as few parameters as 
necessary. This statistic can be estimated from the data as 

((5epSF5epsF)e = 3^ [<5epSF(a:a)5ep3p(a;(,)], (6) 


where the average is taken over pairs of objects for which 
\xa — Xb\ falls in a bin around angular separation d, and 5R 
and * denote the real part and complex conjugate, respec¬ 
tively. Analogously, we also measure the correlation function 
of the residual size SRpsf- 

Fig. 5 shows the residual correlation functions mea¬ 
sured at 1 aremin separation. We chose this scale as it is 
the smallest scale that can be reliably measured given the 
typical star density in the images. The data come from our 
sample of KiDS sub-exposures for six different PSF mod¬ 
els, with the full field of view polynomial order n = 3 and 
4, and chip-dependent polynomial order ric = 0 and 1. We 
also test models without any chip-dependent coefficients, de¬ 
noted ric = none (these models have atotal of |(n-|-l)(n-|-2) 
coefficients). The lower panels of Fig. 5 show the residual 
PSF size correlation as a function of the average PSF size 
Rpgp. We see a general trend, that the larger sized PSFs 
lead to more accurate modelling, which suggests that the 
impact of undersampling, when imaging the PSF, may be 
an important effect to model in the future. The upper pan¬ 
els of Fig. 5 show the residual PSF ellipticity correlation as 
a function of the average PSF ellipticity |epsF|. Unsurpris¬ 
ingly, more elliptical PSFs lead to less accurate modelling. 

Comparing the results from the different models, we 
find a reduction in the residuals with the inclusion of a chip- 
dependent component to the PSF modelling, favouring ric = 
1. With that choice, we find little difference between the 
n = 3 and n = 4 model, selecting the n = 3 PSF model, as 
is has the lowest number of parameters for the two options. 
With n = 3 and ric = 1 we fit NcoeS ~ 103 parameters per 
model PSF pixel. (With several thousand stars per tile, this 
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Figure 5. Selecting the optimal fitting orders for the PSF model for a sample of representative KiDS observations. The upper panels 
show the residual PSF ellipticity correlation, measured at 1 arcmin separation, as a function of the average PSF ellipticity within the 
sub-exposure. The lower panels show the two-point residual PSF size correlation measured at 1 arcmin separation as a function of the 
average PSF size f?p3p. Each data point represents a different sub-exposure, with the point style indicating the polynomial orders (n, ric) 
of the model. Left: results for n = 3; right: results for n = 4. 


large number of parameters can still be determined reliably 
from the data.) 

Analysing the full KiDS data set with this PSF 
model, we find residual correlation functions in the range 
((5i?pgF(5i?PSp)0=i/ = (3.5 ± 1.3) X 10“^arcsec‘*, and ^ 

((5epsF5epsF)e=i' = (7.1 ± 3.5) x 10“®. The size residual 
correlation remains fairly constant as a function of angular ^ 

separation, whereas the amplitude of the ellipticity resid- 
ual correlation decreases with increasing separation, becom¬ 
ing consistent with zero for scales 0 > 20'. The angular 
dependence of the PSF ellipticity correlation function and 
the residuals are shown for an example KiDS field in Ap¬ 
pendix B2. Even though we find persistent PSF residual 
correlations, they are too small to impact our scientific anal¬ 
yses of the data. For example, Rowe (2010) define a re¬ 
quirement on the systematic PSF ellipticity residual with 
correlation amplitude (5£psF(5epsF)s=i' < 5 x 10“®, such 
that it contributes to less than 5 percent of the ACDM cos¬ 
mic shear lensing signal for source galaxies at z ~ 0.5. At 
larger separations the requirement is more stringent with 
(5epSF5epsF)s=io' < 8 X 10“® but, as seen in Fig. B2, the 
KiDS residual correlation functions are already consistent 
with zero on these scales. With the present analysis we there¬ 
fore easily meet the Rowe (2010) target requirement on PSF 
ellipticity residuals for the full KiDS data set. 

PSF modelling software development, currently under¬ 
going testing for future data analysis, allows for the central 
region of the pixel basis PSF model to be oversampled by 
a factor 3. Rather than re-centering each star’s data to its 
best fit position, the fitting proceeds by shifting the model to 
the best-fit data position for each star. These developments 
improve the sampling of the core of the PSF and avoid the 
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Figure 6. The average residual PSF size as a function of the 
magnitude of the star, showing no significant flux-dependence in 
the PSF size. The average non-zero residual (shown as a dashed 
line) is too low to introduce any significant bias in our analysis. 

introduction of correlated noise caused by interpolation of 
the star data in the re-centering process. The disadvantage 
of this procedure is that the model pixel values become cor¬ 
related, requiring a joint fit of a large number of parameters, 
which is computationally expensive. 

3.2.3 Testing PSF flux dependence 

Melchior et al. (2015) report a significant flux dependence 
in the PSF size in Dark Energy Survey data. The effect is 
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due to the use of modern deep-depletion CCDs in DECam 
(Antilogus et al. 2014), and is not expected to affect the 
thinned OmegaCAM detectors used for KiDS. This is in¬ 
deed the case. Fig. 6 shows the difference between the PSF 
model size and the star size, averaged over the full KiDS 
data set, as a function of the star’s magnitude. As the PSF 
model has no flux-dependence by definition, any detected 
flux dependence in the size offset between model and data 
would arise from CCD effects. Only a very slight trend with 
star magnitude is seen, more than an order of magnitude 
smaller than the effect seen by Melchior et al. (2015). The 
origin of the average non-zero residual of (3.3 ± 0.3) x 10“® 
is unclear: most likely it arises from the presence of noise 
in the size measurement of the data, in comparison to the 
measurement on the noise-free model, or from not including 
the effects of undersampling in the PSF modelling. We con¬ 
clude that PSF flux-dependence will not be a challenge for 
the KiDS analysis. 


3.3 Shape measurement with lensRt 


Weak gravitational lensing induces a coherent distortion in 
the images of distant galaxies, which we parametrize through 
the observed complex galaxy ellipticity e = ei -|- ie 2 . For a 
galaxy that is a perfect ellipse, the ellipticity parameters are 
related to the axial ratio q and orientation (j> as 


e = ei -I- ie2 = 



(7) 


Central to any weak lensing study is a data analysis tool that 
can determine galaxy shapes from imaging data. We use the 
lensfit code^ (Miller et al. 2007; Kitching et al. 2008; M-l-13) 
which performs a seven-parameter galaxy model fit (x, y po¬ 
sition, flux, scale length ra, bulge-to-disc ratio and ellipticity 
£ 1 , 2 ), simultaneously to all sub-exposures of a given galaxy, 
taking into account the different PSFs in each sub-exposure 
and the astrometric solution for each CCD. 

Lens&t first performs an analytic marginalization over 
the galaxy model’s centroid, flux and bulge fraction, us¬ 
ing the priors from M-(-13. It then numerically marginalizes 
the resulting joint likelihood distribution L{e,r^) over scale 
length, incorporating a magnitude-dependent prior derived 
from high-resolution Hubble Space Telescope (HST) imag¬ 
ing. Finally, for each galaxy a mean likelihood estimate of 
the ellipticity and an estimated inverse variance weight is 
derived, as described by M-l-13. We will refer to this latter 
quantity as the Sensing weight’. 

The KiDS lensing data are obtained in the r band. We 
therefore change the lensQt scale-length prior with respect 
to the i-band based prior used in the CFHTLenS analy¬ 
sis. For this purpose we repeat the M-l-13 analysis of the 
Simard et al. (2002) catalogue of morphological parameters. 
This catalogue is based on GALFIT galaxy profile fitting 
(Peng et al. 2010) of HST imaging data, and provides disc 
and bulge parameters in various wavebands including the 
F606W filter which is a good match to the KiDS r band. 
Selecting galaxies with 18.5 < reoe < 25.5, we find the fol¬ 
lowing relation between the median disc scale length and 



Figure 7. Left: Small and large stellar haloes, due to reflection 
from different pairs of surfaces within the VST. Right: Same im¬ 
age, displaying the two types of halo masks demarcated by small 
and large circles. The reflection halo centroids are offset from the 
star; relative to the centre of the field-of-view, the small (large) 
halo centroids lie inwards (outwards). The bright star in these 
images has an r magnitude of ~10. The large circle has a radius 
of 210". 


magnitude: 

ln(rd/arcsec) = —1.320 — 0.278(r606 — 23). (8) 

We note that the more extensive HST galaxy morphology 
analysis by Griffith et al. (2012) satisfies our requirements 
in terms of imaging depth and filter choice. However, it is 
limited to single Sersic profile fits which prevents the selec¬ 
tion of disc-dominated galaxies with which to determine a 
scale-length prior for the disc component. 

As discussed in M+13, the measurements do not 
strongly constrain the shape of the prior of ra and we there¬ 
fore adopt the same functional form (M-l-13, appendix Bl). 
For the bulge scale-length prior, the small numbers of bulge 
dominated galaxies in the Simard et al. (2002) catalogue 
prevent a robust determination. We continue to fix the half- 
light radius of the bulge component to be the exponential 
scale length of the disc component, as motivated in appendix 
A of M+13. 

The change of the galaxy size prior is the only significant 
change in lens&t as compared to the CFHTLenS analysis. 
While appropriate, its effect on the results is small: Hilde- 
brandt et al. (in preparation) present an analysis of the RC- 
SLenS survey where similar changes in the scale-length prior 
are shown not to impact the measured shear amplitudes by 
more than a few percent. 


3.4 Masking of the KiDS images 

The masking of the r-band Theli reduction uses the AU- 
TOMASK tool® to generate automated masks, which come 
in three types. ‘Void masks’ indicate regions of high spuri¬ 
ous object detection and/or a strong density gradient in the 
object density distribution (see Dietrich et al. 2007). ‘Stel¬ 
lar masks’ are generated based on standard stellar catalogs 
GSC-1 (complete at the bright end, Lasker et al. 1996) and 


See H+12 for a discussion on why lensRt is our preferred shape 
measurement method. 


® http://marvinweb.astro.uni-bonn.de/data_products/ 
THELIWWW/automask.html 
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UCAC4 (Zacharias et al. 2012, complete from r ~ 10 to 
~ 16). The stellar catalogs are used to mask the brighter 
stars as well as associated small and large reflection haloes, 
using mask radii and centroid offsets that were derived em¬ 
pirically for OmegaCAM as illustrated in Fig. 7. Finally, the 
‘asteroid masks’ flag asteroids and satellite trails. The AU- 
TOMASK algorithms and procedures are described in more 
detail in Erben et al. (2009). 

Fig. 8 shows the effect of bright stars, gronped by mag¬ 
nitude, on the neighbonring ‘sonrce galaxies’, defined here 
as objects with valid shape measurements. The upper panel 
shows the relative source number density within the large re¬ 
flection haloes as a function of the radial distance from the 
centre. The annular halo clearly results in a source count in¬ 
completeness out to ~ 200” from the halo centre, the sever¬ 
ity of which increases with stellar magnitnde. The detection 
incompleteness is essentially identical whether the source ob¬ 
jects are unweighted, or weighted using the lensGt weights, 
implying that the source density count deficiency originates 
in the object detection stage. 

The second panel of Fig. 8 shows the tangential shear 
measured by lensG-t for objects detected within the large re¬ 
flection haloes, as a function of the distance from its centre. 
In general this signal is found to be consistent with zero, 
on all scales, indicating that the local sky background sub¬ 
traction performed by lensGt removes any bias introduced 
by the haloes. The cross shear signal, not shown, is also 
consistent with zero. For the brightest stellar sources with 
r < 10.5, however, there is a ~2cr coherent tangential ellip- 
ticity detected at the halo edges at ~ 170”, and on small 
scales < 50”. For this reason we mask and remove the areas 
with reflection haloes from the scientific analyses. A sim¬ 
ilar analysis was also performed within the other, smaller 
halo seen in Fig. 7, showing identical trends in source count 
incompleteness and shape coherence. 

Based on this analysis, we define two reflection halo 
masks: a ‘conservative’ mask, with a magnitude limit at 
r = 11.5, to indicate the regions of source density incom¬ 
pleteness, and a ‘nominal’ mask that flags regions where 
there are signs of a coherent shear (r < 10.5). The stellar 
halo masks are based on both the GSC-1 and UCAC4 cata¬ 
log. 

The lower panels of Fig. 8 investigate source incom¬ 
pleteness and radial alignment of source galaxies around the 
centre of the bright stars themselves, where no sources are 
detected within 10”of the star, as these pixels are typically 
saturated. Again we see that the incompleteness and shape 
coherence depends strongly on the stellar magnitude and the 
radial dependence of this effect determines the area masked 
around each star. All stars in the UCAC4 catalog with 
r < 14.0 are masked, with masking radius (in ”) determined 
from the stellar magnitude as TZmask = 2.96r^ — 81.2r -|- 569. 
Taking an example r = 11 magnitude star, 7?mask = 34”, 
thereby masking the full area within which a significant co¬ 
herent negative tangential shear is measured. 

The automatically generated masks were visually in¬ 
spected, and additional manual masking was performed if 
necessary. A number of the early observations are affected 
by stray light from bright objects outside the field-of-view 
as a result of poor baffling of the telescope (see deJ-|-15 for 
some examples). Additionally, a number of missed asteroid 
and satellite tracks were masked manually in the co-added 
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Figure 8. The impact of bright stars on source galaxy counts 
and galaxy shapes. The upper two panels show the number count 
completeness and tangential shear measured within the large re¬ 
flection haloes as a function of the radial distance from the centre; 
the dashed vertical line indicates the 210" radius that is used to 
mask these haloes. For the very brightest haloes, a coherent tan¬ 
gential alignment of ~l-2 percent can be seen at the edges of the 
large reflection halo, and on small scales. The lower two panels 
show these same quantities as function of distance from the centre 
of bright stars. We see two effects: a decrease in the source galaxy 
counts, and a strong, coherent radial shape alignment immedi¬ 
ately around the star, which can be removed from the sample by 
applying stellar masks. 


image. Manual masking is also used to cover areas of non¬ 
uniformity which the void automask had missed, or for ad¬ 
ditional stellar halo masks in cases where the bright stellar 
catalogues are incomplete. The manual masking is then in¬ 
spected by a single person to check for uniformity. 

In total, the automated masks, using the conservative 
halo reflection scheme, along with the manual masks from 
the leasing pipeline remove 32 percent of the imaged area. 
With recent improvements at the VST to reduce scattered 
light, we anticipate the masked area fraction to reduce in 
future analyses. For this first analysis of 109 square degrees 
of KiDS data that overlap with GAMA, the total unmasked 
area is A = 75.1 square degrees. 


3.5 Effective number density of lensed galaxies 

In its current implementation, lensGt is quite conservative 
when it comes to rejecting galaxies whose isophotes might 
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be affected by neighbours. The final lensfit shape catalogue 
contains a total of 2.2 million sources with non-zero lensing 
weight, with an average number density of 8.88 galaxies per 
square arcmin over the unmasked area A of 75.1 square de¬ 
grees. While this raw number density provides information 
about the number of resolved, relatively isolated galaxies, it 
does not represent the true statistical power of the survey. 
When weights are employed in the analysis to account for 
the increased uncertainty in the galaxy shape measurements 
of smaller or fainter objects, the effective number density is 
reduced. 

Chang et al. (2013) propose an effective number density 
defined as 




2 

^SN 


'"SN + 


(9) 


where (tsn is the intrinsic ellipticity dispersion (‘shape 
noise’) and am,* is the measurement error for galaxy i. With 
this definition, rieff represents the equivalent number density 
of high SNR, intrinsic shape-noise dominated sources with 
ellipticity dispersion (tsn, that would yield a shear measure¬ 
ment of the same accuracy. 

As the lensG-t weights are designed to be an inverse vari¬ 
ance weight, w~^ ~ (tIn + with the intrinsic ellipticity 
dispersion fixed to a value usn = 0.255, we can estimate riefi 
as 

rieft ~ u'SN“ 4.48arcmin”^ . (10) 

The inverse shear variance per unit area, w, that the survey 
provides is thus equal to 

w = = 69 arcmin"^ (11) 


which corresponds to a l-cr shear uncertainty of (wA) = 
0.12/-\/]V when averaging N square arcminute of survey. 
While this definition is useful for forecasting, it makes a 
number of assumptions; that the shape noise and measure¬ 
ment noise are uncorrelated, that the estimated inverse vari¬ 
ance weight is exact, that the intrinsic ellipticity dispersion 
does not evolve with redshift and that it can be accurately 
measured from high-SNR imaging of low redshift galaxies. 

H-t-12 propose an alternative definition of an effective 
number density defined as 


Gb = 


i(E^ 


5.98arcmin 


( 12 ) 


With this definition, n*B represents the equivalent number 
density of sources with unit weight and a total ellipticity 
dispersion per component of that would yield a shear 
measurement of the same accuracy where 


2 


1 Ei 

2 E.t^? 


(13) 


For KiDS we measure Ue = 0.278 per ellipticity component, 
which is very similar to the ellipticity dispersion measured in 
CFHTLenS. This definition is useful as it makes no assump¬ 
tions about how the weight is defined. As the shot noise com¬ 
ponent for cosmic shear measurement scales with /rieff, 
the difference between these two definitions for KiDS would 
change the expected shot noise error on a cosmic shear sur¬ 
vey by ~ 10 percent. 


4 KiDS PHOTOMETRY AND PHOTOMETRIC 

REDSHIFTS 

Without good redshift estimates any weak lensing data set 
is of limited use, as redshifts are required to determine the 
critical surface density that sets the physical scale for all 
lensing-based mass measurements. For the moment, KiDS 
photometric redshifts are derived from ugri imaging, and 
are adequate for the first lensing science analyses from the 
survey (Viola et al. 2015; Sifon et al. 2015; van Uitert et al., 
in preparation). Combination with the VIKING near-IR flux 
measurements will be used to rehne the redshifts further in 
future. 

The colours of the galaxies are obtained with ‘Gaussian 
Aperture and PSF’ (GAaP) photometry, a novel technique 
that is designed to account for PSF differences between ob¬ 
servations in different filter bands while optimizing SNR. 
The procedure is summarized in §4.2 below, and described 
in detail in Appendix A. 

We base our photometric redshifts on the bpz code of 
Benftez (2000). Further details are given in §4.4 below. Al¬ 
ternative photometric redshift techniques based on machine¬ 
learning are also being investigated (Cavuoti et al. 2015), 
but have not been integrated into the lensing analysis at 
this point. 


4.1 Data reduction 

The KiDS photometric redshifts are based on the co-added 
images provided in the public data releases. The process¬ 
ing from raw pixel data to these calibrated image stacks 
is performed with a version of the Astro- WISE pipeline 
(McFarland et al. 2013) tuned for KiDS data. We refer the 
reader to deJ-|-15 for a detailed description of all the steps. 

There are some small differences between the Theli re¬ 
duction of the r-band data described in §3.1, and the four- 
band Astro- WISE processing. The latter uses a single flat 
held per hlter for the entire data set, since a dome-hat analy¬ 
sis shows that the peak-to-valley variations of the pixel sensi¬ 
tivity were less than 0.5 percent over the period during which 
the data were taken. Also, the i-band data require a de- 
fringing step, and different recipes are used to create the illu¬ 
mination correction maps (which are applied in pixel space), 
and the pixel masks that hag cosmic rays and hot/cold pix¬ 
els. Satellite track removal is automatic (currently imple¬ 
mented on a per-GGD basis). Finally, background structure 
from shadows cast by scattered light hitting the shields that 
cover the GGD bond wires are subtracted separately in a 
line-by-line background removal procedure. All images are 
visually inspected and masked if necessary before release. 

Photometric calibration starts with zero points derived 
per GGD from nightly standard held observations, tied to 
SDSS DR8 PSF magnitudes of stars (Aihara et al. 2011). 
The calibration uses a hxed aperture (6.3” diameter) not 
corrected for hux losses. Magnitudes are expressed in AB in 
the instrumental system. For g, r and i the photometry is 
homogenized across all GGDs and dithers for each survey tile 
individually. In u-band the smaller source density often pro¬ 
vides insufhcient information for this scheme. The resulting 
photometry is homogeneous within two percent per tile and 
hlter. Due to the rather fragmented distribution of observed 
tiles in the hrst two data releases, no global photometric 
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calibration over the whole survey is feasible yet, resulting in 
random offsets in the absolute zero points of the individual 
tiles thus obtained. For the GAMA tiles, which overlap with 
SDSS, we correct these offsets after the fact. Detailed analy¬ 
sis and statistics of the photometric calibration are presented 
in deJ-l-15. 

A global astrometric calibration combining all CCDs 
and dithers is calculated per filter for each tile using a second 
order polynomial. The de-trended sub-exposures are then 
re-gridded to a 0.2” pixel scale, photometrically scaled, and 
co-added to produce the image stacks. 


4.2 Gaussian aperture and PSF photometry 
(GAaP) 

Photometric redshifts of galaxies require accurate colour 
measurements. These colours do not need to describe the 
total light from the galaxy, but they should represent the 
ratio of the fluxes from the same part of the galaxy in dif¬ 
ferent filter bands. This means that we can optimize SNR 
by measuring the colours of the brighter, central regions of 
galaxies without the need to include the noise-dominated 
low surface brightness outskirts. 

Such aperture photometry is complicated by the fact 
that the PSF is not constant: it varies from sub-exposure to 
sub-exposure, with position in each image, and with wave¬ 
length. We correct for PSF variations in two steps. First, 
we homogenize the PSF within each co-added image to a 
Gaussian shape without significantly degrading the seeing. 
The resulting images contain most of the information that 
is present in the original stacks, with a simpler PSF but cor¬ 
related noise between neighbouring pixels. Second, we per¬ 
form aperture photometry using elliptical Gaussian aperture 
weight functions, and correct analytically for the seeing dif¬ 
ferences. 

In brief, the PSF Gaussianization of each KiDS tile con¬ 
sists of the following steps: 

(i) We model high-SNR stars in the co-added image with 
a shapelet expansion (Refregier 2003), using the pixel-fitting 
method described in Kuijken (2006). This formalism pro¬ 
vides a natural and mathematically convenient framework 
for PSF modelling and image convolutions. The scale radius 
(i.e., size of the parent Gaussian in the shapelet expansion) 
of the shapelets is matched to the worst seeing found in the 
individual sub-exposures making up the co-added image for 
each filter. 

(ii) We then derive a PSF map by fitting the variation of 
the shapelet coefficients across the image, using polynomials. 

(hi) We construct a grid of kernels that yield a Gaussian 
when convolved with the model PSF, also expressed in the 
shapelets formalism. The size of the ‘target’ Gaussian is set 
by the shapelet scale chosen in step (i). We fit the spatial 
variation of these kernels’ coefficients using polynomials, re¬ 
sulting in a kernel map. 

(iv) Each co-added image is convolved with its kernel 
map. 

(v) The shapes of the PSF stars on this PSF-Gaussianized 
image are modelled once again with a shapelet expansion, 
but now using a larger scale radius in order to measure 
residual flux at large radii. A map of the residual PSF non- 
Gaussianities is then made as above, and used to make a 


perturbative correction to the Gaussianized image to im¬ 
prove the PSF Gaussianity further. 

(vi) As a result of the convolution (and to a lesser extent, 
also from the preceding re-gridding before co-addition) the 
noise in these images is correlated on small scales. We keep 
track of the noise covariance matrix during the Gaussianiza¬ 
tion, and account for it in the photometric measurements. 

The GAaP photometry is performed from these PSF- 
Gaussianized, co-added images for all sources in the r-band 
THELl-fensfit catalogue. First we pick an elliptical Gaussian 
aperture for each source, with aperture size, shape and ori¬ 
entation chosen to optimize the SNR of the fluxes, based on 
the pre-Gaussianization r-band image. For major and minor 
axis lengths a and b, and orientation a with respect to the 
pixel coordinate grid, we construct an ‘aperture matrix’ 

_ ( o? cos^ a + b^ sin^ a (a^ — b^) sin a cos a \ , , 

y {a? — b^) sin a cos a c? sin^ a + b^ cos^ ® / 

which in turn is used to define the GAaP flux Fw as the 
Gaussian-weighted aperture flux of the pre-seeing image of 
the source, Ipie{x): 

Fw = J da:/pre(a:)e“2“ . (15) 


F\rj is well-defined and manifestly PSF-independent, but 
since it is defined in terms of the pre-seeing image it is a 
theoretical construct. However, it is possible to measure this 
quantity from a Gaussian-smoothed image Ig = Ipie < 8 ) G 
(where G is a Gaussian PSF of dispersion p and (8) denotes 
convolution) using the identity 


Fw 


, f da: /G(a:)e-i“^('"-^')-^“’ 
det(W — p^l) 2 J 


(16) 


which is valid for any PSF size p < a, b (i.e., as long as the 
aperture is larger than the PSF). 1 denotes the identity ma¬ 
trix. For a given source, provided the same aperture matrix 
W is used for all bands, Eq. 15 shows that this technique 
returns fluxes that weight different parts of the source con¬ 
sistently. 

A detailed description of the PSE Gaussianization 
pipeline, propagation of the noise correlation due to the con¬ 
volution, and a discussion and derivation of the GAaP flux 
formalism, may be found in Appendix A. We stress that 
these aperture magnitudes are not designed to be total mag¬ 
nitudes. 


4.3 Photometric calibration 

As described above, the photometric zero points of the co¬ 
added images used for the current analysis are calibrated 
based on nightly standard star field observations, and no 
global photometric calibration is included. To improve these 
absolute zero points, a cross-calibration to SDSS is done 
before the derivation of the photometric redshifts. 

We calibrate against the eighth data release of the SDSS 
(Aihara et al. 2011), which represents the complete SDSS 
imaging and fully overlaps with the KiDS-GAMA fields. 
Stars are selected from SDSS and matched to the KiDS 
multi-colour catalogues. We choose a magnitude range where 
the OmegaCAM sub-exposures are not saturated and SDSS 
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photometry is sufficiently precise. Over this range we aver¬ 
age the differences in the photometry between our GAaP 
measurements and the SDSS PSF magnitudes in all four 
bands (ugri). We find no trend with magnitude, confirming 
that the difference is a pure zero point offset. 

The distribution of the differences for all 114 fields is 
similar to the one shown in deJ-|-15. We find the mean off¬ 
set to be consistent with zero in the ^-band and offsets 
of ~0.02mag, ~0.05mag, and ~0.06mag in the r-, i, and 
M-bands, respectively. Field-to-field scatter is in the range 
2.5-5 percent. The offsets are applied to each field globally 
relying on the photometric stability of SDSS and the KiDS 
illumination correction. All subsequent analysis is based on 
these re-calibrated magnitudes. 

4.4 Photometric redshifts 

The KiDS photometric redshift estimates are obtained fol¬ 
lowing the methods used for CFHTLenS (Hildebrandt et al. 
2012). We use the Bayesian photometric redshift code bpz 
(Benitez 2000), a spectral template-fitting code, together 
with the re-calibrated template set by Capak (2004). 

To assess the accuracy of our photometric redshifts, 
we also produce stacks from VST data in two fields with 
deep spectroscopic coverage, the Chandra Deep Field South 
(CDFS) and the COSMOS field. These data were taken un¬ 
der the VOICE (De Cicco et al. 2015) project. Total expo¬ 
sure times in these fields are much longer than for typical 
KiDS observations, but individual sub-exposures are simi¬ 
lar to those from KiDS, allowing us to produce stacks with 
similar depth and seeing as a typical KiDS field. We extract 
catalogues and photometric redshifts in the same way as 
for the KiDS tiles, and then match the resulting photomet¬ 
ric catalogues with the combined CDFS spectroscopic cata¬ 
logue® and a deep zCOSMOS catalogue (zCOSMOS team, 
private communication). In the following we compare the 
KiDS photometric redshifts to the high-confidence spectro¬ 
scopic redshifts from these catalogues. 

Fig. 9 shows the r-band magnitude number counts of 
the lensing catalogue (weighted by the lensGt weight, see 
Sect. 3) and the spectroscopic matches (unweighted). This 
deep spectroscopic sample spans the full magnitude range 
of the lensing sample, with broadly similar distribution, and 
therefore we do not apply any further weighting. This is 
also the reason why we concentrate on the zCOSMOS and 
CDFS fields here. Adding in the numerous bright spectro¬ 
scopic redshifts from SDSS and CAMA would not add signif¬ 
icant information about the performance of the photometric 
redshifts of the faint KiDS sources. 

A straight comparison of the Bayesian photometric red¬ 
shifts, Zb, and the spectroscopic redshifts, Zspec, is shown in 
Fig. 10. To quantify the level of agreement, we characterize 
the photometric redshift of each galaxy by the relative error 

~ . (17) 

1 “t“ ^spec 

and plot its statistics in bins of magnitude and redshift in 
Figs. 11 and 12, respectively. We use the mean of Az as a 

® http://www.eso.org/sci/activities/garching/projects/ 
goods/MasterSpectroscopy.html 


0.45 



19 20 21 22 23 24 25 


r 

Figure 9. Number counts in the r band of the lensing catalogue 
(blue, weighted by lensfit weight) and the spectroscopic catalogue 
(red, unweighted). 


measure for the photometric redshift bias, the fraction of ob¬ 
jects with I Az| > 0.15 as the outlier rate, and the RMS scat¬ 
ter after rejection of the outliers as the dispersion. We show 
the statistics for different cuts on the bpz ODDS parameter 
(see Benitez 2000), which is a measure of the uni-modality of 
a galaxy’s posterior redshift distribution. Cutting on ODDS 
usually leads to slightly better photometric redshifts at the 
expense of losing objects. This is reflected in the complete¬ 
ness fraction, plotted in the bottom panel of Figs. 11 and 
12 . 

These tests check for the accuracy of the photometric 
redshift point estimates. Such point estimates can be used 
to select galaxies in certain redshift regions, to define tomo¬ 
graphic redshift bins, and to distinguish between foreground 
and background galaxies in different lensing applications. 
The modelling of the lensing measurement, however, makes 
use of the full photometric redshift posterior probability dis¬ 
tributions p(z) that BPZ estimates for each galaxy, and in 
that sense p{z) is the more crucial quantity for the weak 
lensing science goals. 

We have checked that the summed p{z) posteriors of the 
galaxies plotted in Fig. 10 agree well with their spectroscopic 
redshift distribution provided we exclude galaxies whose zb 
values lie at the extremes of the redshift distribution of the 
spectroscopic calibration sample. After some experimenta¬ 
tion, based on these results as well as on Fig. 12, we cut our 
galaxy catalogue at 0.005 < zb < 1.2 in all lensing analyses. 

Detailed characterization and testing of the p(z) will 
be presented in forthcoming papers (Choi et al. in prep., 
Hildebrandt et al. in prep.). 


4.5 Galaxy Clustering analysis 

As a further test of our photometric redshifts, following 
(Newman 2008) we calculate the angular cross-correlation 
of the positions of CAMA and KiDS galaxies on the sky, 
grouped by spectroscopic (CAMA) and photometric (KiDS) 
redshifts. Calaxies that are physically close will produce a 
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^spec 

Figure 10. Photometric redshift vs. spectroscopic redshift in the 
CDFS and COSMOS fields for objects with 19 < r < 24. Con¬ 
tours are spaced in 0.5-cr intervals with the outermost contour 
corresponding to the 2-a level. Photo-z are estimated from four- 
band ugri data from the VOICE project in the two fields, stacked 
so as to approximate the KiDS depth and seeing. Spec-z are from 
the combined ESO CDFS catalogue and a deep zCOSMOS cat¬ 
alogue. For this sample we find a photo-z scatter of 0.054 after 
rejecting 11 percent of the galaxies as outliers. The photo-z bias 
for this sample is 0.01. 

strong clustering signal, and hence this measurement can 
validate photometric redshift estimates. 

GAMA is a highly complete spectroscopic survey down 
to a limiting magnitude of r < 19.8, measuring redshifts 
out to Zspec = 0.5. We group the GAMA galaxies into five 
redshift bins i of width Azspec — 0.1. We limit the KiDS 
galaxies to r < 24, and group them into eight photomet¬ 
ric redshift bins j, listed in Fig. 13. The photometric red- 
shifts extend beyond the GAMA redshift range to zb = 1-2. 
The projected angular clustering statistic Wgg(d), between 
spectroscopic bins i and photometric bins j, is then esti¬ 
mated using the Landy & Szalay (1993) estimator by means 
of the ATHENA code (Kilbinger et al. 2014). Errors are calcu¬ 
lated using a jackknife analysis. We focus on angular scales 
1' < 0 < 30', where the upper angular scale is set by signal- 
to-noise constraints, and the lower angular scale is chosen to 
reduce the impact of scale dependent galaxy bias on the mea¬ 
surements (Schulz 2010). The results are shown in Fig. 13 
with the spectroscopic redshift bin i increasing from top to 
bottom, and the photometric redshift bin j increasing from 
left to right. 

The strongest angular clustering is found when the spec¬ 
troscopic and photometric redshift sample span the same 
redshift range, which can be seen along the ‘diagonal’ of 
Fig. 13 where i = j. This is anticipated if there is no signif¬ 
icant bias in the photometric redshift measurement zb- As 
the photometric redshifts have an associated scatter, we also 
see clustering between adjacent spectroscopic and photomet¬ 



18 20 22 24 


r 


Figure 11. Statistics of the photometric vs. spectroscopic red¬ 
shift discrepancy Az as a function of r-band magnitude in the 
CDFS and COSMOS fields. From top to bottom: clipped RMS 
dispersion, outlier fraction, average offset, and fraction of galaxies 
in each given ODDS cut (normalized to the total). 


ric redshift bins. With the exception of the 0.2 < zb < 0.3 
bin, we find non-zero clustering only in matching or adja¬ 
cent bins, implying that the photometric redshift scatter is 
less than the spectroscopic bin width Az = 0.1. This is con¬ 
sistent with the analysis presented in §4.4 which found the 
scatter < 0.08 out to zb = 1.2. 

A correlation between the positions of galaxies in widely 
separated redshift bins would indicate the presence of catas¬ 
trophic errors in the KiDS photometric redshifts. We see 
this to some extent in the non-zero clustering measured be¬ 
tween the 0.2 < Zb < 0.3 and 0.4 < Zspec < 0.5 galaxy 
samples, indicating that a small fraction of the photometric 
redshifts in this bin are actually at a higher redshift. This 
measurement could be used to infer the true redshift distri¬ 
bution of this galaxy sample (see for example McQuinn & 
White 2013, which is beyond the scope of this paper). For all 
other photometric redshift bins, we find the clustering sig¬ 
nal to be consistent with zero for all bin combinations sepa¬ 
rated by Az = 0.1 or more. We can therefore conclude that 
the fraction of ‘catastrophic outliers’ is low, in agreement 
with the direct spectroscopic-photometric redshift compari¬ 
son presented in §4.4. 

We consider this analysis as a validation of our redshift 
estimates. A similar conclusion is drawn from the analysis of 
the cross-correlation between different photometric redshifts 
bins of KiDS galaxies, presented in deJ+15, which extends 
the cross-correlation between bins beyond redshift z = 0.5 
which cannot be probed with the GAMA catalogues. 
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Figure 12. As Fig. 11, but plotted as a function of photometric 
redshift 2 b. 


4.6 The combined shear-photometric redshift 
catalogue 

In §4.4 we defined a photometric redshift selection criterion 
0.005 < 2 b < 1.2 to ensure a good level of accuracy in the 
photometric redshifts. We now combine that redshift selec¬ 
tion with the shape measurement analysis by also selecting 
galaxies with a lensfit weight w > 0 (this cut excludes all 
galaxies for which no shape measurement was obtained, see 
M+13). The upper panel of Fig. 14 compares three redshift 
distributions for this sample of galaxies, showing the distri¬ 
bution of the 2 b point estimates of the photometric redshift, 
and the weighted and unweighted sums of the associated pos¬ 
terior distributions p(z). The weighted distribution, plotted 
as the thick solid line, is the one most relevant for our anal¬ 
ysis: it is the effective redshift distribution of the lensing 
information, and has a median redshift of Zm = 0.53. 

The weights used in the lensing analysis favour higher 
SNR galaxies which are typically at lower redshift in this 
flux-limited survey, and hence the weighted median red¬ 
shift is lower than that of the unweighted sample (which 
has Zm = 0.63). Indeed if the shape measurement criterion 
w > 0 had not been applied, the unweighted median redshift 
would be even higher with Zm = 0.66. This is illustrated in 
the lower panel of Fig. 14, which shows the effective red¬ 
shift distribution for galaxies with different bpz ODDS pa¬ 
rameters: the more precise photometric redshifts, with high 
ODDS, also tend to be at lower redshifts (e.g., the weighted 
median redshift for galaxies with ODDS> 0.9 is 0.43). As 
the ODDS value decreases, so does the accuracy of each indi¬ 
vidual photometric redshift, owing to multiple peaks in each 
galaxy’s posterior distribution that result from degeneracies 
in the redshift solution. In the stacked posterior shown in 
Fig. 14, these degeneracies are responsible for the shape of 
the distribution at the peak. 


Fig. 14 illustrates the importance of using the full poste¬ 
riors p{z) instead of the best-fit photometric redshifts 2 b to 
define the survey redshift distribution. The point estimates 
are more prone to artefacts associated with the particular 
filter set used. They also do not reflect the full information 
content of the photometry. As an illustration of how using 
2 b could bias a lensing analysis. Fig. 15 shows the measured 
angular diameter distance ratio Dis/Da for a lens at red¬ 
shift 2 i = 0.25. Using 2 b or p{z) to determine the redshift 
distribution of the background lensed sources, changes the 
average distance ratio by ~ 10 percent. As the distance ratio 
defines the lensing efficiency of sources at different redshifts, 
using 2 b instead of p{z) would result in an underestimate of 
the lensing surface mass density by ~ 10 percent. 


5 TESTS FOR SYSTEMATIC ERRORS IN THE 

KiDS LENSING CATALOGUE 

Different science cases require different levels of accuracy in 
the shear and photometric redshift catalogues. It is common 
to model calibration corrections to shear measurement in 
terms of a multiplicative term m and additive terms Ck such 
that 

ef= = (l + m)4™ + Cfe, (fc = l,2), ( 18 ) 

where are the observed ellipticity parameters, and 
the true galaxy ellipticity parameters (Heymans et al. 2006). 
Massey et al. (2013) present a compilation of possible sources 
of such correction terms, and calculate requirements on their 
amplitudes for different kinds of analysis. In an ideal shape 
measurement method, both m and Ck would be zero. In real¬ 
ity however, these corrections need to be determined so the 
data can be calibrated, and then systematics tests performed 
to ensure the calibration is robust. 

Our first series of lensing science papers measure shear- 
position correlation statistics, also known as galaxy-galaxy 
lensing, where the tangential shear of background galaxies is 
determined relative to the position of foreground structures. 
As this measurement is taken as an azimuthal average, it 
is very insensitive to additive correction terms Ck except on 
scales comparable to the survey boundaries. It is, however, 
very sensitive to the accuracy of the measured multiplica¬ 
tive calibration m, an error which leads directly to a bias 
in the mass determined from the lensing measurement. Fur¬ 
thermore, these measurements rely on a good knowledge of 
the photometric redshift distribution to determine the level 
of foreground contamination in the background source sam¬ 
ple and hence the level of dilution expected in the measured 
lensing signal. In this section we therefore first describe the 
analysis done to validate the multiplicative calibration m 
used, and then verify that the redshift scaling of the galaxy- 
galaxy lensing signal is consistent with the expectation based 
on the photometric redshift error distributions. 

In this technical paper we also present the first demon¬ 
stration of the suitability of the data for cosmological mea¬ 
surements through two-point shear statistics. Such an anal¬ 
ysis places more stringent requirements on the accuracy of 
the shear catalogue, in particular the additive corrections 
Cfe. We therefore perform an additional set of tests, follow¬ 
ing H-l-12, first selecting fields where the cross-correlation 
between the measured shear signal and the PSF pattern is 
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Figure 13. Angular cross-correlations between KiDS galaxies binned by photometric redshift, and GAMA galaxies binned by spectro¬ 
scopic redshift. Spectroscopic redshifts increase from top to bottom, and photometric redshifts increase from left to right. 


consistent with zero systematics. We then empirically deter¬ 
mine the Cfe terms from the remaining data. 


5.1 Multiplicative calibration 

The multiplicative calibration term m can only be deter¬ 
mined through the analysis of image simulations where the 
true galaxy shapes are known. M-l-13 describe the CFHT 
MegaCam image simulations against which lensfit was cali¬ 
brated extensively in the CFHTLenS analysis. The primary 
aim of these simulations was to correct for noise bias (Hirata 
et al. 2004; Refregier et al. 2012; Melchior & Viola 2012). On 
average the noise bias resulted in a ~ 5 percent correction 
to the measured shear, with more significant corrections for 
smaller, fainter galaxies. This analysis provided a calibration 
correction that depends on the lensHt parameters SNR and 
size rd as 

m(SNR, rd) = ;- ^ exp(-Q rd SNR), (19) 

with a = 0.306 arcsec”^ and /3 = —0.37. 

The r-band KiDS VST-OmegaCAM imaging differs 
from the simulated i-band CFHT MegaCam imaging in a 
few key respects. The pixel scales differ: 6'pix = 0.213'' for 
OmegaCAM and 0pix = 0.186" for MegaCam. The KiDS 
data are shallower than CFHTLenS, and while the mean 
PSF FWHM values for the two sets of leasing data are the 


same (0.64"), the average KiDS PSF ellipticity is ~ 15 per¬ 
cent smaller than the average CFHTLenS PSF. We verify in 
two different ways that this CFHTLenS correction is suitable 
to use for KiDS: (i) using a re-sampling technique such that 
the simulated catalogues better match the KiDS data, and 
(ii) by comparing the galaxy-galaxy lensing signal around 
bright galaxies in CFHTLenS and KiDS for progressively 
fainter source samples. 


5.1.1 Re-sampled image simulations 

Fig. 16 compares the measured properties of galaxies in the 
image simulations from M-l-13 (thin lines) to the properties 
of galaxies in KiDS (thick lines). The upper panels com¬ 
pare the SNR distributions in bins of increasing galaxy size^ 
showing that the image simulations have a deficit of small 
galaxies. M-l-13 concluded this arose from an overestimate 
of the true PSF size when creating the image simulations. 
Compared to the image simulations, which are a good match 
to the SNR distribution of the CFHTLenS data, we also see 
a higher proportion of low SNR galaxies in KiDS. This arises 
because CFHTLenS imposed a magnitude limit i < 24.7 on 

^ In principle this comparison should be made in terms of the rel¬ 
ative galaxy-to-PSF size, but as the KiDS and CFHTLenS imag¬ 
ing have similar seeing distributions we work with galaxy size in 
arcseconds. 


MNRAS 000, 1-37 (2015) 















Gravitational Lensing Analysis of KiDS 17 




Figure 14. The galaxy photometric redshift distribution. Upper 
panel: summed posterior redshift distributions n{z), with (solid 
line) and without (dashed line) weighting by the lensht weight. 
The effective median redshift of the lensing survey is Zm = 0.53. 
The histogram shown in this panel shows the distribution of the 
z^ point estimates of the photometric redshift. Lower panel: the 
lensing weighted posterior n(z) distributions of galaxies in pro¬ 
gressively lower ODDS categories (see text). 


their galaxy sample, based on the depth to which photomet¬ 
ric redshifts were considered reliable. For KiDS, we do not 
include a similar imposed fixed magnitude limit, see Fig. 9, 
as the depth of the survey is within the limits covered by 
deep spectroscopic surveys. 

Comparing the ellipticity distributions as a function 
of galaxy size (middle panels) and SNR (lower panels) in 



Figure 15. Effect of using the full photometric redshift posterior 
p{z), or the point estimate z-q to determine the angular diameter 
distance ratio Di^/Ds for a lens galaxy at redshifts zi = 0.25. 
The average distance ratio D\s/Ds sets the lensing efficiency and 
differs by 10 percent depending which redshift measure is used 
(dashed lines). 
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Figure 16. Comparison of the observed properties of galaxies in 
the image simulations from Miller et al. (2013) (thin lines) to the 
observed properties of galaxies in KiDS (thick lines). The upper 
panels compare the signal-to-noise ratio (SNR) distributions in 
bins of increasing galaxy size (in arcseconds). The ellipticity dis¬ 
tributions can be compared as a function of galaxy size (middle 
panels) and SNR (lower panels). 
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Figure 17. Four panels on the left: KIDS vs. CFHTLenS comparison of the average tangential shear around galaxies with 20 < r < 21 
measured from progressively fainter source populations. The insets show the average multiplicative correction factor, as derived from 
Eq. 19 and applied to the plotted measurements. Right-hand panel: bin by bin comparison of the shear values for 9 between 1 and 20 
arcmin. The red line shows the best-fit linear regression, and the grey zone the corresponding l-cr uncertainty (errors on both axes are 
taken into account). 


Fig. 16, we see an excess of simulated galaxies of large el- 
lipticity in the high-SNR regime. As shown in Viola et al. 
(2014) and Hoekstra et al. (2015), calibration corrections 
can be sensitive to the ellipticity distribution. For the pur¬ 
poses of the analysis of our first 100 square degrees, we re¬ 
sample the simulated galaxy catalogues from M-l-13 such 
that the simulated ensemble galaxy properties match the 
KiDS data in terms of size, SNR and ellipticity. This is pos¬ 
sible as the image simulations from M+13 simulated two 
complete CFHTLenS surveys. Hence while there is a deficit 
of small, low SNR galaxies in the simulations, relative to the 
global populations, there are sufficient numbers with which 
to validate the calibration scheme m from Eq. 19, for KiDS, 
in this under-represented regime. 

We sample galaxies from the image simulations, such 
that the correlations that exist between observed size, ob¬ 
served SNR and observed ellipticity in the data are retained. 
As lensht performs a joint parameter fit of galaxy ellipticity 
and size, selecting galaxies based on their observed size will 
introduce a selection bias on galaxy ellipticity. It is there¬ 
fore critically important not to subject fensfit catalogues to 
any ‘cleaning criterion’, for example rejecting small galaxies 
based on the /ensfit size estimate. Instead we use the /ensfit 
weights to optimally combine the shape measurements. Fol¬ 
lowing M+13 we determine the accuracy of the CFHTLenS 
calibration correction for KiDS by calculating 


Sm 


E.fc [1 + m(SNR, ru)] - efn 


-0.04+0.02, 

( 20 ) 


where the sum is taken over the simulated galaxies i in the 
re-sampled image simulation catalogues, weighted by the ob¬ 
served /ensfit weights Wi, and calculated for both compo¬ 
nents k of the ellipticity. We find that the CFHTLenS cal¬ 
ibration correction underestimates the calibration required 
for KiDS by a few percent®, which is within the current 
statistical error budget for the early science presented in Vi¬ 
ola et al. (2015), Sifon et al. (2015) and van Uitert et al. 
(in preparation). We also verified that this underestimate 
did not vary significantly as a function of galaxy SNR, as 
it arises from the increased fraction of small galaxies in the 
sample. A new suite of KiDS image simulations are in pro¬ 
duction using the GALSIM software (Rowe et al. 2015), in 
preparation for future analyses in which the larger area sur¬ 
veyed will demand a more accurate calibration scheme. 


5.1.2 Galaxy-galaxy lensing at different signal to noise 
ratio: KiDS vs. CFHTLenS 

In this section we apply an additional consistency check to 
confirm the findings of the image simulation re-sampling 
analysis, using real data. We verify that the SNR depen¬ 
dence of the multiplicative calibration is robust by compar- 


® We note an error in the calculation of Eq. 19 used in the first 
KiDS lensing analyses (Viola et al. 2015; Sifon et al. 2015; van 
Uitert et ah, in preparation) that did not correctly account for 
the different MegaCam and OmegaCAM pixel scales. By luck this 
error erroneously increased the average value of m, such that the 
KiDS-correction 5m was reduced to 5m = —0.03 ± 0.02. 
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ing galaxy-galaxy shear measurements from observations of 
different depths. To divorce this test from any uncertain¬ 
ties in photometric redshift, we dehne lens and source sam¬ 
ples purely by r-band magnitude. We then compare the 
dimensionless, m-calibrated tangential shear profile 'yt{d) 
measured with KiDS and with the deeper CFHTLenS data 
(E-l-13). The lens samples are selected with 20 < r < 21, 
and four source samples are selected in half-magnitude bins 
from r = 22 to r = 24. For the brightest sources the average 
calibration corrections from Eq. 19 are only a few percent 
for both surveys, but the faintest bin includes a 14 percent 
calibration correction for KiDS compared to a 4 percent cor¬ 
rection for CFHTLenS. Figure. 17 shows the good agree¬ 
ment between the calibrated KiDS and CFHTLenS tangen¬ 
tial shear profiles, measured between 1 and 20 arcmin, for 
the fonr different source samples. To quantify the consis¬ 
tency we perform a direct bin by bin comparison of the 
measured shears in the right-hand panel of Fig. 17. Fitting 
a simple proportionality relation to the points, using uncor¬ 
related bootstrap errors, as motivated by the results of the 
analytical prescription described in Viola et al. (2015), we 
find a best-£t ratio of (KiDS/CFHTLenS) = 1.05 ± 0.13. 

5.2 Testing redshift scaling with galaxy-galaxy 
lensing 

As objects get fainter, our ability to measure shape, pho¬ 
tometry and photometric redshifts degrades. On the other 
hand the fainter galaxies tend to be at higher redshifts, 
and therefore they experience a stronger lensing distortion. 
Measuring the dependence of the lensing signal with source 
redshift can in principle provide tight constraints on the 
growth of structure and geometry of the Universe. It is there¬ 
fore imperative to perform a cosmology-insensitive joint test 
of the shear-redshift catalogue and determine whether any 
redshift-dependent shear bias exists. In H-l-12 a galaxy- 
galaxy lensing test of shear-redshift-scaling was designed 
that was found to be only very weakly sensitive to the fidu¬ 
cial cosmology assumed in the analysis. The mean tangen¬ 
tial shear yt is measured around a sample of lens galaxies 
for a series of source galaxies split by increasing photomet¬ 
ric redshift, zb- We approximate the mass distribution of 
the galaxies in the lens sample as simple isothermal spheres 
with a fixed velocity dispersion (jy. The predicted tangen¬ 
tial shear around the lens sample i, measured from source 
sample j, is then given by 

Here c is the speed of light, and Dis/Ds is the ratio be¬ 
tween the angular diameter distances from the lens to the 
source, and from the observer to the source. The average 
of this ratio depends on the effective redshift distribution 
of the lens and source sample (see for example Bartelmann 
& Schneider 2001). For a fixed lens sample, we should re¬ 
cover consistent measurements of a^, independent of which 
source sample is used. Any discrepancy indicates either a 
poor knowledge of the photometric redshift distribntion for 
that source sample, a redshift-dependent shear measurement 
bias, or a strong redshift dependence in the velocity disper¬ 
sion (Jv of the lenses within their foreground redshift bin. 

Fig. 18 shows the tangential shear determined at one 



Figure 18. The tangential shear measured at one arcminute as 
a function of the average redshift of the source sample, for two 
samples of GAMA lenses with spectroscopic redshifts between 
0.25 < Zs < 0.5 (filled) and Zs < 0.25 (open). The solid line 
shows the predicted signal from the best-fit SIS model, with the 
dashed lines showing the 68 percent confidence interval. 

arcminute, for source galaxies in seven bins of zb spanning 
0.005 < Zb < 1.5. Two samples of lens galaxies from GAMA 
were used, with spectroscopic redshifts between 0.25 < Za < 
0.5 (filled) and Za < 0.25 (open). 

The solid line connects the predicted signals from the 
best-fit SIS model, assuming a Planck cosmology, taking into 
account the full redshift posterior p(z) for the sources in 
each bin. The amplitude of the model is set by fitting to all 
sources with photometric redshifts 0.2 < zb < 1.0, which is 
considered to be the safest photometric redshift range based 
on the results presented in Fig. 12. The dashed lines show 
the 68 percent confidence intervals on the model amplitudes. 

As expected, the signal increases as the average red¬ 
shift of the source sample increases. We also see that the 
signal and model do not tend to zero for low zb , even though 
the mean source photometric redshift is in front of the lens. 
This is a result of a non-zero fraction of catastrophic out¬ 
liers in the photometric redshift sample that are actually at 
high redshift, causing a signihcant tangential shear signal. 
By taking acconnt of the fnll photometric redshift poste¬ 
rior probability distributions of the sources, the knowledge 
of catastrophic outliers enters the model, generating an up¬ 
turn at low source redshift (note that such 1 ow-2;b galaxies 
which are actually at high redshift do not show up in the 
cross-correlations in Fig. 13 as they fall outside the GAMA 
redshift range). This analysis shows that, within the current 
SNR of the measurement, our shear-redshift catalogue is not 
subject to significant redshift-dependent shear biases. 

5.3 Field Selection for cosmic shear test 

H-l-12 describe a method to identify observations with sig¬ 
nihcant residual contamination of the galaxy shapes by the 
PSF. It involves comparing the correlation between galaxy 
and PSF shape, measured in the data and with mock cata¬ 
logues. As a result 25 percent of the GFHTLenS tiles were 
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flagged as unsuitable for cosmic shear science; nonetheless 
these data could be retained for the galaxy-galaxy lensing 
analyses as the azimuthal averaging renders the measure¬ 
ment essentially insensitive to additive PSF errors. We fol¬ 
low CFHTLenS in not applying field selection for our first 
series of galaxy-galaxy lensing science papers, but repeat the 
H-l-12 analysis on KiDS in order to assess its future compet¬ 
itiveness for cosmic shear science. We summarize the key 
steps of the analysis, and refer the reader to H-l-12 for a 
detailed description. 

The ellipticity estimate for each source can be written 
as 

e°b° = -I- 7 -I- 77 -I- Aaya,i£pSF , (22) 

where is the intrinsic galaxy ellipticity, 7 is the true cos¬ 
mological shear that we wish to detect, and rj is the random 
noise on the shear measurement whose amplitude depends 
on the size and shape of the galaxy in addition to the SNR 
of the observations. The final term reflects residual amounts 
of PSF contamination from the various sub-exposures i that 
‘print through’ to the final galaxy ellipticities. Even though 
the coefficients Aays,i should be very small for good shape 
measurement pipelines, this term can generate significant 
coherent correlations when the shapes of many galaxies on 
the same tile are averaged. 

From a set of N sub-exposures of a part of the sky 
{N = 5 in the case of KiDS r-band data) H+12 define a 
vector of star-galaxy cross-correlation coefficients ^ag, with 
one element per sub-exposure: 

^sg = (e°b'*epsF) = (e'"'‘epgF) + (7 ^psf) + {v ^psf) + CAsys , 

(23) 

where the average is taken over all galaxies in the pointing. 
Here epsF is a vector of PSF ellipticity patterns, one per sub¬ 
exposure, determined from the PSF model at the locations of 
the source galaxies in each sub-exposure. C is a matrix whose 
elements Cij = (epsF^psF ) f^e average covariance of 
PSF ellipticities between the sub-exposures. The complex 
conjugate of the ellipticity is denoted with a *, and only the 
real part of the averages in Eq. 23 is kept (as in Eq. 6 ). We 
have assumed that Agys does not vary across the field of 
view. 

For a sufficiently wide area, the first three terms of 
Eq. 23 average to zero, in which case Asys = C“^^ag. The 
contribution of this systematic ellipticity error to the two- 
point shear correlation function, ) jg then given by 

A^obs = 4JgC-'6g • (24) 

We wish to use ^ag as a diagnostic with which to identify 
those tiles where, for whatever reason, the PSF modelling 
has left significant residuals that would contaminate the 
shear-shear correlation function. The KiDS data are taken 
in square-degree tiles, and on these scales the measurement 
of ^sg will have contributions from the first three noise terms 
in Eq. 23 through chance alignments between the different 
noise, PSF and cosmic shear fields. We therefore estimate 
the expected amplitude of A^obs, a positive quantity, from a 
series of 184 simulated KiDS data sets each containing 109 
systematics-free one-square degree mock catalogues. These 
mock catalogues are populated to match the intrinsic el¬ 
lipticity and measurement noise in the data. A correlated 


cosmic shear signal is also added, drawn from the A-body 
simulations of Harnois-Deraps et al. (2012), following the ef¬ 
fective galaxy redshift distribution n{z) of KiDS shown in 
Fig. 14. Fig. 19 shows the distribution of 5])(A^obs), where 
the sum is taken over all 109 mock fields, for the 184 differ¬ 
ent mock realizations of KiDS. The dashed line shows the 
result we would have obtained if the mock catalogues had 
contained a cosmic shear signal only, to emphasize that the 
two-point star-galaxy cross correlation function will be non¬ 
zero even in the absence of ellipticity noise. We then measure 
the average star-galaxy cross correlation coefficient for each 
field observed, with the result summed over all fields shown 
as the hashed rectangle in the upper panel. The difference 
between the expected result from the mock simulations and 
the data shows that some fields do indeed contain strong 
PSF residuals. To isolate these fields we determine a prob¬ 
ability p for each field that A^obs is consistent with zero 
systematics (see H-t-12 for details). We then set a thresh¬ 
old on this probability such that the data (shown hashed) 
match the expected distribution from the simulations, a re¬ 
quirement met when p > 0.11. We find that this procedure 
rejects only 4 of our fields (3.7 percent, cf. 25 percent for 
CFHTLenS), suggesting that the PSF modelling in KiDS is 
of a significantly higher quality than in CFHTLenS, as could 
have been expected owing to the clean OmegaCAM PSF. 

5.4 Additive calibration correction 

For the data that passed our field selection (§5.3) we mea¬ 
sure the average weighted ellipticity components (£ 1 , 2 ). For 
a KiDS-size survey, in the absence of systematic error, these 
should be consistent with zero. As with the analysis of 
CFHTLenS (H-l-12), we find a small residual shear signal 
in KiDS at the level of ~ 10“^ (shown in Fig. 20). The 
dependence on galaxy size and SNR is different though. In 
CFHTLenS small, high-SNR galaxies were found to be the 
dominant source of the residual signal in (£ 2 ) whereas (£ 1 ) 
was consistent with zero: instead, for KiDS we find that the 
lowest SNR galaxies dominate the residual, which is stronger 
in (£ 1 ). In addition we see a strong dependence of (ci) on the 
Strehl ratio (defined here as the fraction of light in the PSF 
model that falls into the central pixel), which could be a 
sign of error due to undersampling of the PSF. Indeed, with 
typical pixel-to-seeing ratio of 0.25 for CFHTLenS and 0.3 
for KiDS, we expect KiDS to be more prone to such errors. 
Future analyses of KiDS will therefore include a PSF mod¬ 
elling method that correctly accounts for the under-sampling 
(Miller et al. in prep). For this first release, however, we fol¬ 
low the CFHTLenS strategy of calibrating and removing 
this small systematic effect empirically. Note that the first 
lensing analyses are based on tangential shear averages and 
are therefore not affected by such additive errors as long as 
the analysis is not affected by the survey boundaries: for 
the current data set we see no sign of additive effects out to 
projected radii of 2/i“^Mpc (Viola et al. 2015). 

Using all the data that passed the field selection in 
§5.3, we bin the data in three dimensions with six bins 
in size and SNR, and three bins in Strehl ratio, and fit 
a 3D second-order polynomial model to the bins®. Fig. 21 

® For our first set of galaxy-galaxy lensing papers, an earlier ver- 
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Figure 19. Field selection based on the degree of correlation be¬ 
tween the PSF ellipticity pattern and the galaxy ellipticities as 
compared to simulated data. See the text for the definition of 
J^(A^oba)) which quantifies the degree of residual PSF contami¬ 
nation in measurements of the two-point shear correlation func¬ 
tion. The histogram shows the expected range of this statistic in 
simulations, and the hashed region indicates the measured value 
it the l-cr bootstrap error. For comparison the dashed histogram 
shows the expected range for shape-noise free simulations. Top: 
all 109 KiDS fields. Bottom: result of field selection (see text for 
details). 


presents example slices from the data cube and the model 
fit. Applying the c-correction to the shear catalogue changes 
the one-point statistics ((ei,e 2 )) from (-0.0015,-0.0002) 
to (0.0004,0.0004), with a l-cr uncertainty of 0.0003. This 
is sufficiently small that it will not impact the measure¬ 
ment of the two-point shear correlation function presented 
in § 6. This level of residual shear will however impact fu¬ 
ture degree-scale cosmological shear measurements, requir¬ 
ing improvements in the calibration scheme for future data 
releases. 


sion of the additive correction was applied that used a third-order 
polynomial fit to a 3D binning with ten bins on each axis. On 
further inspection this sub-optimal set-up was discovered to in¬ 
troduce a low level of spurious noise into the shape measurement. 
As the shear-position correlations were found to be insensitive to 
the additive correction we only updated the additive calibration 
for the cosmological analysis demonstration in this paper. 


Figure 20. The weighted mean ellipticity components (ei) (left) 
and (e 2 } (right), as a function of PSF Strehl ratio (upper), galaxy 
size (middle) and galaxy SNR (lower). The points are shown be¬ 
fore (open symbols) and after (closed symbols) the empirical cal¬ 
ibration has been applied, with the latter offset horizontally for 
clarity. 

6 COSMIC SHEAR MEASUREMENT 

The measurement of weak gravitational lensing by large- 
scale structure, often referred to as ‘cosmic shear’, has the 
ability to set tight constraints on both standard cosmolog¬ 
ical parameters (see for example Heymans et al. 2013, and 
references therein), and a range of modified gravity scenar¬ 
ios (Simpson et al. 2013; Planck Collaboration et al. 2015b). 
While the amount of data analysed in this paper represents 
less than 10 percent of the final KiDS area, in this section 
we argue that the data quality is at the level that the full 
survey will indeed provide high-fidelity cosmic shear mea¬ 
surements. It also provides a practical demonstration of our 
blinding scheme, which has been designed to counter user 
confirmation bias in future KiDS cosmic shear analyses. 

6.1 Blinding the KiDS weak lensing catalogues 

In the post-Planck precision cosmology era, one the chal¬ 
lenges facing new cosmological observations is confirmation 
bias (e.g., Croft & Dailey 2011). Many new surveys are there¬ 
fore following the approach, particularly favoured by the 
particle physics community, of performing a ‘blind’ analy¬ 
sis. The first stage of such an analysis is the verification and 
validation of software packages through the analysis of mock 
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Figure 21. The measured dependence of (ei) (left) and ( 62 ) 
(right) as a function of SNR, for three different size bins (panels 
upper to lower r = 0.6”, 0.3”, 0.2”), and two different Strehl ratio 
bins with Strehl = 0.05 (open symbols)) and Strehl = 0.1 (closed 
symbols). The corresponding best-fitting models are shown as 
solid (Strehl = 0.05) and dashed (Strehl = 0.1) lines. 

simulated data. The KiDS Al-body simulations span 30,000 
square degrees with a WMAP9 cosmology (Hinshaw et al. 
2013), and are an extension of the suite of lensing simula¬ 
tions described in Harnois-Deraps & van Waerbeke (2015). 
With these simulations we can verify the analysis methods 
for galaxy-galaxy lensing, galaxy-cluster lensing and tomo¬ 
graphic cosmology, and also determine covariance matrices 
for the analysis of the data. 

This mock data strategy does not prevent confirmation 
bias in the analysis of the real data, where potentially un¬ 
known sources of systematic error increase the complexity of 
the analysis. For example, choices are currently made about 
which sub-exposures or pointings to excise from the analysis 
based on the outcome of a range of systematic tests on the 
shear measured in these regions. Choices are also made as to 
which length scales to include in the analysis of correlation 
functions or power spectra, which binning to use, and which 
photometric redshift ranges to trust. It is therefore impor¬ 
tant to build blinding into our data analysis such that these 
choices are informed purely through scientific rationale, and 
not influenced by the results of independent experiments. 

An example of an early blind cosmological data analysis 
is Davis et al. (2007) where the analysis team was given su¬ 
pernova data in which the redshifts had been stretched. This 
strategy of manipulating the data with a small multiplica¬ 


tive perturbation has also been used by other groups, but 
has the drawback that when the data are finally unblinded, 
the analysis has to be re-run. This potentially allows for 
low-level adjustments in the re-analysis, for example choos¬ 
ing which scales to include. We have therefore designed an 
alternative blinding scheme that prevents this, by ensuring 
that the true data are analysed along with the perturbed 
versions. 

All KiDS weak lensing catalogues analysed contain four 
sets of ellipticity data: the true data, and three versions that 
have been manipulated by an unknown amount. Specifically, 
the magnitudes of the ellipticities in column A — 1, 2, 3,4 of 
the catalogues are ‘curved’ with a function 

eA = e (25) 

parametrized by a single value such that Cmax, the maxi¬ 
mum ellipticity in the catalogues, is left invariant under this 
remapping. The values {A:a} are unknown, except that for 
one of them, the true data, A:a is equal to zero. The differ¬ 
ences between the /ca can easily be reconstructed by divid¬ 
ing the shear columns, but this provides no information as 
to which column contains the true ellipticities. The values 
of kA were limited to |fcA| < 0.2, in order to satisfy two con¬ 
ditions. On the one hand, the effect of the transformation 
should be sufficiently large that it effectively blinds KiDS 
to confirmation bias with CMB measurements from Planck, 
by changing the results up to ~ IOct in terms of the Planck 
error on the amplitude of the matter density power spec¬ 
trum (Planck Collaboration et al. 2015a). At the same time 
Eq. 25 must not distort the lensing values to such an extent 
that it creates suspicious effects in galaxy-galaxy lensing, 
ellipticity distributions, SNR or redshift scaling. We asked 
a trusted colleague, external to the team, to set the values 
of kA through a Python executable that takes the original 
lensing catalogues output from lensQt (see §3), manipulates 
the ellipticity columns, according to Eq. 25, and outputs a 
new catalogue with the additional blind columns inserted in 
an order unknown to any member of the KiDS team. 

The team members agreed that they would not wilfully 
unblind themselves by attempting to back-track the data 
manipulation to discover which column contains the orig¬ 
inal data. All analyses are carried out on all four sets of 
columns, including systematics tests, empirical corrections 
and covariance matrix estimation. Different fields may pass 
or fail the systematics tests in different blinded columns and 
this has been taken into account in the final analysis. Even 
though this setup incurred a factor of four increase in the 
computational analysis time, we felt this was a necessary 
step to make, whilst also encouraging the good practice of 
creating, verifying and validating ‘press-of-the-button’ end- 
to-end analysis scripts. In order to allow for phased unblind¬ 
ing, team members add an additional individual layer of 
blinding by not labelling their results with the blinded col¬ 
umn number used. Pre-publication, our results were sent to 
our external who provided the blinding key, and can ver¬ 
ify that the results presented in both this paper, and our 
first scientific analyses (Viola et al. 2015; Sifon et al. 2015) 
were not changed after it was revealed to the authors which 
column contained the true shear. We show an example of 
the blinding scheme in action in the next section, where we 
present the cosmic shear measurement from the four blinded 
shear measurements. 
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Thus far our blinding is limited to the shape measure¬ 
ments only and future blinding will also include manipula¬ 
tion of galaxy weights and potentially photometric redshifts, 
stellar masses and galaxy luminosities. As our first analy¬ 
sis covers less than 10 per cent of the final KiDS area, the 
blinding described here had only a small effect on the early 
science results presented in the accompanying Viola et al. 
(2015) and Sifon et al. (2015) papers. We agreed however, 
that it was important to implement this blinding scheme 
from the beginning, in order to learn from this ‘dry run’ 
in preparation for the future larger-area KiDS cosmological 
analyses. 


6.2 Second order weak lensing statistics 


To detect weak lensing by large-scale structures and extract 
cosmological parameter constraints and information about 
systematics from the data, a wide range of different two- 
point statistics have been proposed (see Schneider et al. 
2002b, 2010, for a comprehensive discussion of the relation¬ 
ship between these statistics). These real-space statistics all 
derive from the observed angular two-point correlation func¬ 
tion which can be estimated from the data as follows: 


i±{d) 


lOgWgWj, [et(a;a)et(a:i,) ± Cx (a;a)ex (a:;,)] 
5D0 WaWb 


(26) 


Using inverse variance weights w, the sum is taken over pairs 
of galaxies with angular separation \xa — xi,\ — 9 i: A6I2, 
where A6 is the width of the bin^°. The tangential and cross 
components of the ellipticities et,x are measured with re¬ 
spect to the vector joining each pair of correlated objects 
(Bartelmann & Schneider 2001). 

Weak gravitational lensing produces curl-free gradient 
distortions (E-mode), and contributes only to the curl dis¬ 
tortions (B-mode) at small angular scales, 9 < I arcmin, 
mainly due to source redshift clustering (Schneider et al. 
2002a). Decomposing the weak lensing signal into E and 
B modes therefore provides a method with which to gauge 
the contribution to the overall shear correlation signal from 
non-lensing sources. These could arise from residual system¬ 
atics in the shape measurement method, or from the intrinsic 
alignment of nearby galaxies (see Troxel & Ishak 2015, and 
references therein). 

Crittenden et al. (2002) show that the shear correlation 
functions, estimated in Eq. 26, can be decomposed into the 
E- and B-type correlators 


^K(.) = 


and 


(27) 


where 

^19 r, rl'j9 

^'(e) = ^_(e)+4| (28) 


The measured E-mode ^e(^) is related to the underlying 




Figure 22. Comparison of the E-type (upper) and B-type (lower) 
shear correlation functions measured using all the data (dashed); 
after the application of the field selection (open points); and after 
the application of both field selection and the additive calibra¬ 
tion correction (solid). Without these two corrections the B-mode, 
which is an indicator of non-lensing systematic errors, becomes 
significantly non-zero on large scales. Note that the B-mode ver¬ 
tical axis has been multiplied by 0 (in arcminutes) in order to 
emphasize the differences from a zero signal. 


non-linear matter power spectrum Ps that we wish to probe, 
via 

j Ml(l) Jo.4 m, (29) 

where Jo, 4 (^ 0 ) is the zeroth (for ^+) or fourth (for ^_) order 
Bessel function of the first kind. Pk,{1) is the convergence 
power spectrum at angular wave number £ 

where a{w) is the dimensionless scale factor corresponding 
to the comoving radial distance w, and wh is the horizon 
distance. The lensing efficiency function q{w) is given by 


q{w) 


SUgQin 

2c2 


/ dw n{w — f - - X 2 , 


(31) 


where n{w)dw is the effective number of galaxies in dw, 
normalized so that f n(w)dw = 1. fK{w) is the angular di¬ 
ameter distance out to comoving radial distance w, Ho is the 
Hubble parameter and the matter density parameter at 
z = 0. For more details see Bartelmann & Schneider (2001) 
and references therein. 


Note that the final reported angular scale of the bin should 
not be the mid-point of angular range selected, but the weighted 
average separation of the galaxy pairs in that bin. 


6.3 KiDS shear correlation data and survey 
parameters 

Fig. 22 presents the derived E- and B-type shear correlation 
functions, from Eq. 27. These were calculated following the 
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Figure 23. The E-type shear correlation functions from the 105 
tiles of KIDS data that pass the PSF systematics tests in §5.3. 
Measurements from all four blindings are shown, with the true 
shear measurement indicated with an error bar. For comparison 
the E-mode signal expected from three different ACDM cosmo¬ 
logical models are shown; Planck cosmology using the TT spec¬ 
tra (dashed), and EE spectra (dotted) along with the best-fit 
CFHTLenS result (solid). Note that the vertical axis has been 
multiplied by 9 (in arcminutes) in order to improve the visualisa¬ 
tion by enhancing the differences. 

method in Pen et al. (2002), using 4000 finely binned mea¬ 
surements of the shear correlation function ^±{0) spanning 
9” < 9 < 4° in equal bins of log0. As our data extend over 
many degrees, but not to infinity, we use a fiducial cosmo¬ 
logical model to determine the integrand in Eq. 28, splitting 
the integrals into two. The first is calculated from the obser¬ 
vations directly, extending from 6 to 0max where Smax = 4°. 
The second extends from 6'niax to oo and is calculated by in¬ 
serting (9) calculated from Eq. 29 assuming the KiDS red- 
shift distribution and the best-fit Planck cosmology (Planck 
Collaboration et al. 2015a). This model dependent part of 
the integrand sums to ~ 10“^ for the three cosmological 
models that are compared in Fig. 23. This model depen¬ 
dence prevents cosmological parameter estimation directly 
from the E-mode signal. The analysis is still a valid diag¬ 
nostic test for residual systematics, however, as the model- 
dependent addition to Eq. 28 is less than 10 percent of the 
total signal on the largest angular scales probed. The er¬ 
rors are estimated following Pen et al. (2002), treating each 
noisy finely binned raw shear correlation measurement as 
uncorrelated with the others. We then propagate these un¬ 
correlated errors through to a final correlated error on the 
coarsely binned E- and B-type shear correlation functions. 
This approximation is sufficient for this diagnostic test as 
the current KiDS area is relatively small such that for the 
majority of scales the data are shot-noise dominated. 

Focussing first on the measured E mode presented in 
the upper panel of Fig. 22, the small effect of removing the 


4 percent of fields that failed the selection stage (§5.3) can 
be seen, as well as the result of subsequent application of 
the additive calibration correction (§5.4). The impact of this 
two-step calibration can also be seen in the B-mode signal 
(lower panel), which is consistent with zero on all scales, 
demonstrating excellent control of systematic errors in shape 
measurement with KiDS. Without the field selection or ad¬ 
ditive ellipticity corrections, however, we find a significant 
B-mode signal on scales 9 > 10'. In preparation for future 
releases we are currently implementing a number of improve¬ 
ments in both the data reduction pipeline, PSF modelling 
and shape measurement analysis which are designed to re¬ 
duce the significance of the calibration corrections on our 
analysis. 

To illustrate how the implemented blinding scheme 
modified the results. Fig. 23 compares the E-mode mea¬ 
sured from all four blindings, with the true shear measure¬ 
ment indicated as the data point with Poisson error bars. 
For comparison the E-mode signal expected from a range 
of ACDM cosmological models are also shown, using the ef¬ 
fective weighted redshift distribution shown in Fig. 14, as 
estimated from the weighted sum of the photometric red- 
shift probability distributions p{z). The three cosmological 
models use the Planck results from table 3 of Planck Col¬ 
laboration et al. (2015a) showing the difference between the 
cosmology fit to the TT spectra (dashed) and EE spectra 
(dotted) along with the best-fit CFHTLenS result (solid) 
from Kilbinger et al. (2013). 


7 CONCLUSIONS 

In this paper we present the first lensing analysis of the 
Kilo-Degree Survey (KiDS) data obtained at the VLT Sur¬ 
vey Telescope (VST) at ESO’s Paranal Observatory. KiDS 
is a multi-band survey specifically designed for weak lensing 
tomography, that takes advantage of the very good image 
quality at the VST. A particular advantage of the VST, 
where the camera operates at an f/5 Cassegrain focus, com¬ 
pared to much faster wide-field prime-focus cameras, is the 
simplicity and generally low amplitude of the ellipticity pat¬ 
tern, as well as the uniformity of the size of the point spread 
function (PSF) over the full field of view. 

The KiDS lensing analysis draws heavily on heritage 
from the CFHTLenS project (Heymans et al. 2012), in par¬ 
ticular in the use of Theli (Erben et al. 2013) and lensSt 
(Miller et al. 2013) for measuring galaxy shapes (§3), and 
BPZ (Benitez 2000) for photometric redshifts (Hildebrandt 
et al. 2012). As input for the photometric redshifts, aperture- 
matched colours are derived from PSF Gaussianization of 
the public data release of the Astro- WISE reduction of the 
KiDS images (de Jong et al. 2015), and subsequent Gaussian 
Aperture and PSF (GAaP) photometry. This procedure, 
which was developed specifically for KiDS, is described in de¬ 
tail in §4 and Appendix A. The resulting shear/photometric 
redshift catalogues are available to the community (Ap¬ 
pendix G), and form the basis of three companion scientific 
analyses Sifon et al. 2015; Viola et al. 2015; van Uitert et al., 
in preparation) that exploit the overlap of these data with 
the GAMA spectroscopic survey (Driver et al. 2011). The 
KiDS lensing catalogues contain 8.88 galaxies per square ar- 
cminute with non-zero lensing weight, cover an unmasked 
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area of 75 square degrees, and provide an inverse shear vari¬ 
ance of 69 per square arcminute. The median redshift of the 
summed posterior photometric redshift distributions of the 
galaxies, accounting for the lensSt weight, is 0.53. 

Considerable attention was paid to quantifying and cor¬ 
recting the lensing estimates for additive and multiplicative 
bias. In order to validate the galaxy ellipticities, we car¬ 
ried out extensive tests (§5). All indications are that the 
data are indeed ‘lensing-quality.’ For example, the degree of 
star-galaxy shape correlation in the KiDS data is essentially 
consistent with the expectations from realistic simulated 
cosmic shear fields, with just 4 percent of the tiles falling 
outside expected parameter ranges, and the amplitude of 
galaxy-galaxy lensing around magnitude-limited foreground 
lenses scales in the same way as it did in CFHTLenS even 
though the depths of the surveys differ. Taking advantage 
of the GAMA overlap, we also tested the way the tangen¬ 
tial shear around galaxies at known (spectroscopic) redshift 
scales with the (photometric) redshift of the sources. Also 
here we recover the expected dependence, which gives us 
confidence in both the photometric redshifts and the shears 
we measure. 

Finally, in §6 we present a first measurement of the cos¬ 
mic shear correlation function from these data. Though ad¬ 
mittedly still noisy, the results are consistent with previous 
measurements, and show negligible B-mode signal, demon¬ 
strating the high fidelity of the KiDS lensing data. 

KiDS observations continue at the VST, and as the area 
of the survey grows more refined cosmological lensing mea¬ 
surements will follow. 
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APPENDIX A: PSF GAUSSIANIZATION AND 
gaap photometry 

In this Appendix we describe the processing steps involved 
in the PSF Gaussianization, and the subsequent Gaussian- 
aperture photometry. The aim is to obtain multi-colour pho¬ 
tometry that is insensitive to difference in PSF between the 
different bands, yet optimizes SNR by down-weighting the 
outer parts of the images. It is important to stress that 
the resulting fluxes do not provide total magnitudes of the 
sources; they are mainly intended for consistent estimation 
of the spectral energy distribution of the high-SNR parts of 
a source. 

The procedures presented here build on the ideas pre¬ 
sented in Kuijken (2008), but differ in that here (i) an ex¬ 
plicit pixel-space PSF Gaussianization convolution is per¬ 
formed, and (ii) the aperture shape and size can be specified 
independently of the PSF size. 

Al PSF Gaussianization 

We start from a co-added image of a KiDS tile, observed with 
a particular filter, and construct a convolution kernel that 
modifies the PSF everywhere so as to make it a circularly 
symmetric Gaussian. The first step is therefore to model the 
PSF from the many star images that are found on every 
KiDS tile. 

To describe the PSF we model the stars with shapelet 
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expansions following Refregier (2003), and use these to con¬ 
struct and apply a suitable spatially varying convolution 
kernel. Shapelets are Gaussians multiplied by polynomials 
and form elementary, compact, orthonormal 2D functions 
which can be used to ht an image to arbitrary precision. 
The shapelet with scale radius P and Cartesian orders (a, b) 
is (for a, & = 0,1, 2,...) 


x^+y^)l2l3^ 

/3'\/2“+*’7ra!b! 


(Al) 


where Ha(x) is a Hermite polynomial, familiar from the 
eigenstates of the quantum harmonic oscillator. Many use¬ 
ful properties of shapelets, such as their behaviour under 
inhnitesimal translation, rotation, magnification, shear, and 
convolution can be derived (Refregier 2003), and are used 
below. We use the implementation of shapelets of Kuiiken 
(2006). 

Among many applications, shapelet decompositions 
have been used to characterize the galaxy populations (e.g., 
Kelly & McKay 2004; Melchior et al. 2007), to measure 
weak lensing shear (e.g., Refregier & Bacon 2003; Kuijken 
2006) and flexion (e.g.. Bacon et al. 2006; Massey et al. 
2007; Velander et al. 2011) and for PSF-corrected photom¬ 
etry (Kuijken 2008). Even though the shapelets are for¬ 
mulated in Cartesian coordinates, truncating the expan¬ 
sion at maximum combined order N = a -\- b results in an 
orientation-invariant subspace of possible shapes that can 
be described. Such truncated shapelet expansions are most 
effective at modelling structure at radii between p/y/N -\- 1 
and P X \/N + 1. At large radii they asymptotically approach 
a Gaussian. 

Because of the orthonormality of the elementary 
shapelets, any source image I(x, y) can be described as a 
sum J2ab J/) with coefficients 


Sab 


I dxdyI{x,y)S^^,{x,y). 


(A2) 


Since our data are pixellated we do not use this integral 
relation, but rather make a least-squares fit of a truncated 
shapelet model to the image of our source. We fit all pix¬ 
els within a radius (4 -|- v7V)/3 of the centre of the source. 
Typically we truncate the series at = 10. 

Using this formalism, our ‘PSF Gaussianization’ proce¬ 
dure, which we apply to each survey tile and filter separately, 
is as follows. 


A 1.1 Fit shapelet models to all stars 

First we identify high-SNR unsaturated stars in the im¬ 
ages, using the traditional flux vs. radius plot (Kaiser et al. 
1995) obtained from a SExtractor run on the individual 
sub-exposures. Typically several thousand such stars can be 
found per tile. For each GCD the PSF size (FWHM) is av¬ 
eraged over the four or five sub-exposures through a given 
Hlter, and these 32 values are fitted with a second-order 2-D 
polynomial to give a rough map of the average seeing for the 
tile. The scale radius /3p for the shapelet model for the PSF 
is then Hxed at (1.15/2.35) x the largest FWHM value in 
this map (this choice makes the FWHM of the Soo shapelet 
15 percent wider than the PSF, enabling the higher-order 
terms to fit the inner structure of the PSF as well as the 


wings. It will also be the size of the target Gaussian PSF, 
see below). 

Once Pp is chosen for a particular tile and filter, we 
determine shapelet parameters up to order a -|- fo = 10 for 
each star using least-squares fitting (66 coefficients per star). 
This truncation order is set by the pixel size, which is typi¬ 
cally about one-third of the scale radius. All shapelet models 
use Gartesian pixel coordinates x = (a;, y) with respect to a 
centre position ^ = (^, rj) of the star: we define this centre 
as the position for which sio = soi = 0 and determine it 
iteratively. 


A 1.2 Interpolate the PSF model 

The PSF models are then interpolated across the image by 
means of 4th-order polynomial fits of each shapelet coef- 
hcient versus ^ and y position (15 spatial variation coeffi¬ 
cients per shapelet term). In this step outliers are rejected 
iteratively, resulting in a smooth model of the PSF varia¬ 
tion across the image. The PSF model for a particular co¬ 
added image is thus described as a linear combination of 
15 X 66 = 990 terms Sab;ki’- 

a + b<10 

P{x,y,^,y)^ ^ Sab{i,n)sll{x,y), (A3) 

a .= 0;&=0 

where 

fc+Z<4 

Sah{i,n)= ^ Sah-ki{i-^o)^{'n-r]oy (A4) 

fc=0;Z = 0 

and Go,rjo) is the centre of the image. 


A 1.3 Construct the Gaussianization kernel 

Once we have a map of the PSF, the next step is to con¬ 
struct a convolution kernel that renders the PSF Gaus¬ 
sian. Also here the shapelet formalism is convenient, since 
shapelets behave nicely under convolution. Refregier (2003) 
provides recurrence relations for calculating the matrix el¬ 
ements that express the convolution of two basis 

functions as a new shapelet series of arbitrary scale radius 
Po: 

OO 

= E (A5) 

e,/ = 0 

The convolution of an image with shapelet coefficients Sab 
and scale radius P with a PSF that has shapelet coeffi¬ 
cients Pab and scale radius Pp can then be written as a new 
shapelet, with scale radius Po and coefficients 

{p(S)sUa = E ( E Cif/^Pad j Sab = E Prr>-r.abSab , , 

ab \ cd / ab 

(A6) 

where the expression in brackets, the ‘PSF matrix’ P, gives 
the linear transformation from pre- to post-convolved source 
coefficients. Note that the sum in Eq. A6 runs over an inhnite 
number of terms, so that the series needs to be truncated in 
practice. 

Given a PSF, the corresponding Gaussianization kernel 
is a shapelet which, when convolved with the PSF, gives the 
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Figure Al. Illustration of the PSF Gaussianization and homogenization, for a KiDS field with significant PSF variation. The PSF 
model is shown on the left, the Gaussianization kernel in the middle, and a comparison of the radial profiles of the resulting convolved 
PSF (red circles), the target Gaussian (line) and the original PSF (blue dots) on the right. The top row of plots show the PSF in the 
centre of the field, where it is rather round; the second row shows a much more elliptical, and wider, PSF in one of the corners. The 
colour scale is the same for each PSF, running from zero (white) to the peak value in the top panel (dark green). Each kernel has been 
normalized separately to its peak value. Green pixels are positive; the red scale runs from 0 to —5 percent of the peak, to highlight the 
mildly negative regions of the kernel. 


target Gaussian (for which soo = and all other 

coefficients are zero): i.e., we seek those shapelet coefficients 
kab for which 


a.+6<iV;j 

E - 


mnab^ab 


kab 



= 0 


for m + n < N. (A7) 


In order to regularise the calculation of the kernel we trun¬ 
cate kab at shapelet order Nk = 8, whereas the PSF is mod¬ 
elled to order N = 10. This over-constrained set of equations 
for the kab is then solved by minimizing the residuals in the 
least-squares sense. Kernels are constructed in this way for 
121 {^,rj) positions on an 11x11 grid covering the image, 
explicitly normalized to unit integral, and their coefficients’ 
variation across the image fitted with a 5th order polyno¬ 
mial K{x,y,^,ri) (see Eq. A4). This spatially varying kernel 
map is now ready for convolution with the co-added image. 

We set the dispersion of the target Gaussian PSF to be 
equal to the scale radius fip of the shapelet expansion of the 
PSF. We have found that this choice preserves the seeing of 
the image, while avoiding significant deconvolution (and the 
associated noise amplification) anywhere on the image. As a 
final check, we make sure that the peak of this target PSF 


is everywhere lower than 90 percent of the central height of 
the original PSF, and if it is not, we increase the value of /3p 
until it is. An example of the PSF model and the effect of 
the corresponding kernel is shown in Fig. Al. 

Note that this smooth model for the PSF (and hence 
also the kernel) variation across the co-added image does not 
allow for discrete jumps in the PSF, such as might be caused 
by seeing variations between the dithered sub-exposures, or 
misalignments of adjacent chips. While we have no indica¬ 
tions that such effects adversely impact the quality of the 
photometric redshifts, in future releases we plan to construct 
separate Gaussianization kernels for each sub-exposure, and 
to apply these before co-addition. 


AI .4 Image convolution 

Convolving the co-added images by the kernel is mathemat¬ 
ically most convenient in Fourier space, especially because 
of the shapelet formulation: shapelets have simple Fourier 
transforms which can straightforwardly be multiplied into 
the Fourier transform of the image. Since the kernel is com¬ 
pact we can speed up the convolution by splitting the image 
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into smaller segments and processing these separately. In 
practice we split the (over 18000x18000 pixel) co-added im¬ 
ages into segments of 512x512 pixels, which appears to be a 
reasonable optimum on most machines. This step can also be 
parallelised. Each segment I{x,y) is background-subtracted, 
edge-tapered to zero and padded around the edges, with 
the width W of the edge set to four times /3p (see §A1.3). 
This preprocessed segment is then convolved with the ker¬ 
nels Khi, Ktr, etc., at the bottom-left, top-right, etc., corners 
of the segments, and the four convolutions are averaged as 

G{x,y) = {1 - t){l - u){I ® Khi) + {I - t)u{I ® Kti) 

+t{l — u){I ® Khr) + tu{I ® Ktr) , (A8) 

effectively convolving I with a bilinear interpolation of the 
kernel map. (Here t and u run from 0 to 1 along the x and 
y axes of the segment, respectively). The edge of the result¬ 
ing convolved segment is then trimmed by a width 2W, the 
background is added back in, and the pixels are pasted into 
the final output convolved image. 

To prevent numerical issues, pixels that deviate greatly 
from their neighbours are clipped before the image segment 
is Fourier transformed and convolved; copies of the kernel of 
appropriate amplitude are instead added into the convolved 
image at the locations of the clipped pixels. 

A1.5 Noise propagation: noise auto-correlation function 

Convolution of an image correlates pixel noise, and this 
needs to be taken into account in measurements based on 
the Gaussianized co-adds. If G(^) is the result of convolving 
image !{$,) by a kernel K{x), the covariance C'^ between 
the errors on two pixel values G(^) and G{^ S) is 

Cov(y^ dxI{x)K{i- x), j dyI{y)K{i + S - y'^ 

= jj Cov{I{x),I{y))K{^-x)K{^ + d-y). (A9) 

Our kernels are compact, and for all but the brightest 
sources the pixel noise in the original images is background- 
limited and slowly-varying. We can therefore assume that 
the covariance between neighbouring pixels depends mostly 
on the vector distance between those pixels and write 

Cov {I{x),I{y)) ~ G\x - y) = G\y - x). (AlO) 

(Note that strictly speaking the pixel values in the image are 
I[x) times the area of a pixel; for clarity we assume without 
loss of generality that this area is equal to one.) 

Writing x = y — x', y = y' — and using the symmetry 
of G^ (Eq. AlO) we obtain 

G'^(d) = jjdx'dy'GGx')K{x'-y')K{S-y') 

= J dx'C\x'){K*K){S-x'), (All) 

where K*K denotes the autocorrelation function (ACF) of 
the kernel K. The noise covariance matrix G^ of the con¬ 
volved image is therefore the convolution of the original co- 
variance matrix G^ with K*K. In our application the kernel 
ACF is calculated analytically using the shapelets formal¬ 
ism. As the shape of the kernel varies across the image, so 
does its ACF: we model the kernel ACF as a shapelet map 


(see Eq. A4) of the same order as the kernel, with scale 
radius /3p%/2. 

Even before Gaussianization, neighbouring pixels in our 
incoming co-added images have non-zero covariance as a 
result of re-gridding after astrometric calibration, particu¬ 
larly since our images are resampled on to slightly finer 
pixels (0.2") than the native 0.213” scale of OmegaCAM. 
The SwARP code (Berlin et al. 2002) that does this resam¬ 
pling and co-addition produces a weight image that gives 
the inverse variance on each pixel value, but does not report 
covariances. Rather than calculating G^ analytically using 
the form of the Swarp interpolation kernel, we estimate the 
covariances numerically from the images themselves. An ex¬ 
act measurement is not critical here because in any case the 
dominant covariance contribution in Eq. All comes from 
the Gaussianization kernel. 

Assuming that our incoming co-added frame I con¬ 
sists of roughly Gaussian background noise plus positive 
sources, we estimate the shape of its covariance matrix, 
C^(5)/G^(0), from the statistics of pixel values which are 
likely to be source-free. The amplitude will vary across the 
image as a result of variations in background, dither pat¬ 
terns, CCD sensitivity variations etc., but we assume that 
the shape of G^ is constant. The amplitude G^(0) is set by 
scaling G^ so that its peak value agrees with the Swarp 
weight image value at the centre of the source under consid¬ 
eration. 

The noise variance G^(0) of the incoming co-added 
frame is estimated empirically from the statistics of ‘source- 
free’ pixel values, which we select as those below the median 
(background) level B. We estimate the width of the distribu¬ 
tion from their median difference from B, dividing by 0.675 
to obtain a standard deviation estimate d assuming these 
statistics to be approximately Gaussian: 

C^(0) ~ , where d = m/0.675 (A12) 

and m is the median of |/(4) —B| for those pixels with 1 < B. 

The covariance G^(<5) between pixel values separated 
by S is similarly derived from the distribution of values of 
7± = I (/(^) ± -b d)). We obtain 

(S) ct: dl - d^_ , (A13) 

where d± = m±/0.675, and the m± are the medians of 
|/± — B\ for those pixel pairs with 7+ < B. 

These estimated covariances are then modeled as a low- 
order shapelet, and convolved with the ACF of the Gaussian¬ 
ization kernel (Eq. All) to give the total noise covariance 
matrix. 

A 1.6 Residual correction 

Because the shapelet description of the original PSF is not 
perfect - in particular, shapelets asymptotically drop off 
faster than real PSFs - some residual star flux remains at 
large radii after the convolution with the Gaussianization 
kernel. To correct for this, the Gaussianized co-added frames 
are improved further by means of a ‘tweaking’ step. To this 
end, for each star on the Gaussianized image we measure 
the difference AP{x) = P{x) — G{x) of its flux-normalized 
image P to the target Gaussian G. We then fit this residual 
PSF using a spatially variable shapelet expansion as above, 
but with a scale radius that is a factor of two larger than 
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Figure A2. The effect of the residual PSF non-Gaussianity cor¬ 
rection step described in §A1.6. The panels in the top row show 
the residual PSF flux around a bright star after PSF Gaussian- 
ization, both as an image (left) and with pixel values plotted as 
a function of distance from the centre of the star (right). For ref¬ 
erence the target Gaussian PSF profile is plotted as a solid line, 
scaled down by a factor of 100. The bottom row shows the same 
star after the correction step, and demonstrates that the large- 
radius residual flux is removed effectively by this perturbative 
procedure. 


before in order to capture more of the large-scale flux into 
the PSF model. The Gaussianized image is then convolved 
with a kernel (1 — AP (8)“^ G) that corrects, to first or¬ 
der in AP, this residual non-Gaussianity of the PSF. Here 
indicates deconvolution by the target Gaussian PSF 
G of dispersion /3p, which can be done analytically since AP 
is expressed as a shapelet series with scale radius > fip. 
Fig. A2 shows that this process works well. 


A2 GAaP photometry 


Once the PSF is standardized to a Gaussian, it is possible 
to perform aperture photometry in such a way that the an¬ 
swer is independent of the seeing. The central result of this 
‘Gaussian Aperture and PSF’ photometry is the relation be¬ 
tween the pre- and post-seeing Gaussian-aperture weighted 
flux when the PSF is a Gaussian of dispersion p: 


da; 


1 




27rp^ 

det(W) 




— f dx 7(a:)e-5“’''('"+^'^)”'“. (A14) 

1)2 J 


det(W-bp2i) 


Here the expression in square brackets is the PSF-convolved 
image of a source with intrinsic, pre-seeing surface brightness 
distribution I{x), and W is an ‘aperture matrix’ that defines 
the shape, orientation and size of the Gaussian aperture. For 
a Gaussian aperture of major and minor axis dispersion a 
and b, with the major axis oriented an angle a from the 
x-axis, we have 

2 2 I l 2 ’2 / 2 t.2\ • 

a cos a + b sm a (a — b ) sm a cos a 
(a — b ) sm a cos a a sm a -|- o cos a 




Eq. A14 shows that, when the PSF is Gaussian, a 
Gaussian-weighted aperture flux on the PSF-convolved im¬ 
age with aperture matrix W can be related directly to the 
intrinsic Gaussian-weighted aperture flux of the pre-seeing 
image with aperture matrix W -l-p^l. Gonversely, photom¬ 
etry for the intrinsic source with aperture matrix Wint is 
simply done by applying the aperture matrix Wint — to 
a Gaussianized-PSF image, independent of the PSF size p it 
was convolved to, and normalizing appropriately. We there¬ 
fore define the GAaP flux Fw with reference to an aperture 
matrix W on the pre-seeing image I{x), but measure it from 
a PSF-Gaussianized image G{x): 

Fw = J da; I{x)e~^'^ ™ °° 

= /da; G(a;)e-i“^('"-^') 

det(W — p‘^1) 2 J 

For our variable-seeing, multi-band dataset, GAaP photom¬ 
etry therefore provides a way to obtain fluxes which pertain 
to the same part of the galaxy at all wavelengths, from which 
true colours can be derived. 

Since aperture photometry is a linear combination of 
pixel values it is straightforward to take account of the noise 
covariance G^ (see §A1.5) of the Gaussianized image when 
determining the uncertainty on the fluxes derived. The vari¬ 
ance on a GAaP flux measurement is given by 


(ine) 


Var(Fw) 


det(W — p^l )2 J 


(A17) 


With some work it is possible to take advantage of the 
shapelet formulation of G® in this equation to do the in¬ 
tegral analytically. 

We use a dedicated code for the GAaP photometry. 
It first reads the PSF-Gaussianized image, the noise auto¬ 
correlation function map, the weight map corresponding to 
the original co-added image, as well as a list which contains 
each source’s position together with the axis lengths (a, b) 
and position angles a of the corresponding pre-seeing aper¬ 
tures (all in world coordinates). It then transforms these 
to pixel coordinates, performs the aperture photometry ac¬ 
cording to the second line of Eq. A16, and calculates the 
error bar following eq. A17. The same input list is used for 
runs on the images obtained for the tile with different fil¬ 
ters. These multi-band fluxes and corresponding errors are 
then fed into bpz for photometric redshift estimation, and 
merged into the master SExtractor and LENSfit catalogue 
of the tile. 

While for isolated sources any aperture matrix can be 
used to obtain unbiased estimates of the GAaP flux, the op¬ 
timal SNR is obtained when the shape and orientation of the 
post-seeing aperture defined by W — p^l is matched to the 
source. It turns out that the SNR of the flux has a rather 
broad maximum as a function of aperture size (Fig. A3), 
with the details depending on the exact Gaussianization ker¬ 
nel at the location of the source. This means that it is pos¬ 
sible to select a compromise aperture which yields nearly 
optimal SNR in all bands even if these have different see¬ 
ing. After some experimentation we chose an aperture Wint 
which is derived from the SExtractor RMS axis lengths of 
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Figure A3. GAaP apertures on an r ~ 20 galaxy. Top: co-added 
image, with 2" X 0.99” (1-cr) aperture overlaid. Middle: same, but 
PSF-Gaussianized image is shown. Bottom: Signal-to-noise ratio 
(SNR) for different circular apertures (solid curve) and for the 
aperture shown in the other panels (star symbol), for a source as 
the one illustrated but ten times fainter. The Figure illustrates 
the broad maximum of the SNR curve as well as the enhancement 
that follows from approximately matching the aperture size and 
shape to the source. 


the source on the r-band image, where we have the best see¬ 
ing, by adding 0.7" in quadrature to the axis lengths a and 
b. We keep the position angle a aligned with the major axis. 
This choice ensures that also poorer-seeing observations in 
the other bands (chiefly u or i) can still be photometered 
with reasonable SNR. To prevent overlaps with neighbour¬ 
ing sources, we maximize a and b at 2". 

It is important to realize that these GAaP aperture 
photometry values are not total fluxes, but rather the flux 
inside a well-defined tapered aperture. So while they are em¬ 
inently suited to colour measurements, they do not replace 
total magnitudes. The exception is very compact sources 


and stars: in these cases the intrinsic aperture function 
exp(—|a;^WFja:) is equal to 1 over the area where the 
source contributes flux, and hence the GAaP flux does equal 
the total flux. 

Note that even though, for practical reasons, we carry 
out the photometry as a two-step process (first manipulat¬ 
ing the pixels in the image and then photometering the re¬ 
sult) it can also be written as a single photometry step on 
the original variable-PSF stack (albeit with a complicated 
aperture function, given by the convolution of the Gaussian- 
ization kernel with the Gaussian aperture function). Effec¬ 
tively, therefore, our photometry is still a linear combination 
of calibrated pixel fluxes, with a tractable error analysis. 

While here we restrict our use of the Gaussianized im¬ 
ages to the aperture photometry, they can also be used 
for other aspects of the analysis, such as improving star- 
galaxy separation (Pila-Dfez et al. 2014), measurements of 
galaxy morphology parameters, PSF-corrected galaxy el- 
lipticity measurements, or generating multi-colour images 
without colour-dependent PSF effects. 


APPENDIX B: QUALITY CONTROL 

B1 Object detection and photometry 

Once an object catalog is available, we inspect and verify 
the data quality with a ‘check plot,’ generated for each tile. 
Fig. B1 shows an example for one of the KiDS helds, con¬ 
sisting of a collection of twelve plots. The top row shows the 
masked distribution of galaxies (left) and the angular corre¬ 
lation function w{0) for a subset of these objects (right). The 
angular correlation function plot contains data for the given 
field (red), overlaid with those from other tiles within the 
same region of the survey (dotted black). Discrepant corre¬ 
lation functions are a sensitive indicator of spurious object 
detection and surface density inhomogeneity; the combina¬ 
tion of the two plots in the top row thus flags cases where 
additional masking may be necessary. 

The three plots in the second row show the stellar 
distribution (red crosses) and the distribution of galaxies 
with valid shape measurements in grey scale (left); the PSF 
whisker plot from the co-added image (centre), and the r- 
band extinction map (right). The plot on the left allows 
a first check of the star selection: sufficient stars must be 
available over the entire unmasked area for reliable PSF esti¬ 
mation (and hence shape measurements). The PSF whisker 
plot is that of the co-added image, and will indicate any 
issues with astrometry or the star-galaxy separation (the 
slight star-galaxy contamination seen in Fig. B1 is accept¬ 
able). The extinction map allows additional diagnosis of the 
main photometry checks below. 

The bottom two rows contain the main photometry 
checks. The two left-most plots in the third row contain a 
pair of colour-colour diagrams of the stars (grey scale), with 
the predicted stellar loci calculated from Pickles standard 
star SEDs overlaid (red crosses). An error in the zero-point 
magnitude can be detected by a shift in the stellar loci, 
while a broadening or lack of stellar loci indicates a problem 
with the star-galaxy separation. The right-most plot in this 
row shows the photometric redshift distributions for bright 
(blue) and faint (cyan) objects, both for the photometric 
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redshift point estimates zb (histograms) and the stacked 
p{z) probability distribution (smooth curves). The photo¬ 
metric redshift distribution is sensitive to zero-point errors 
in photometry. The bottom row shows the galaxy number 
density per magnitude dn/dm in the four survey bands ugri. 
Similarly to the w{6) plot, these distributions (shown in red) 
are compared to those of neighbouring tiles, plotted with 
dotted black lines. A horizontal shift of dn/dm indicates a 
possible zero-point error, while a vertical offset signihes an 
unusually low number count, most likely the result of a pro¬ 
cessing failure. 

B2 PSF modelling 

Fig. B2 shows an example check-plot for one of the KiDS 
helds, designed to inspect the quality of the PSF modelling 
for the five sub-exposures of each field (each row displays 
the data for each sub-exposure). There are four panels from 
left to right. The hrst panel shows the PSF model, where 
the length and colour of the tick mark shows the direction 
and strength of the PSF ellipticity (defined in Eq. 4). The 
second panel shows the residual PSF ellipticity 5epsF, as de- 
Hned under equation 5. Here the range of the ellipticity-scale 
is reduced to enhance the residuals. The example shown is a 
typical KiDS observation revealing a rather noisy PSF resid¬ 
ual, which at first sight might not appear useful. This type 
of visualisation does however reveal the rare occasions where 
a poorly constrained polynomial model of the PSF (for ex¬ 
ample in a heavily masked region of the image) becomes 
ill-behaved. The two right-hand panels compress the infor¬ 
mation in the two left-hand panels using the two-point PSF 
ellipticity correlation function, where the estimator is given 
in Eq. 26. Between the dotted lines, at ^ = 2.10“® (left) and 
10“® (right) the log-log scale becomes log-linear; the dashed 
line signifies zero correlation. We see that the model accu¬ 
rately predicts the amplitude and angular dependence of the 
two-point PSF ellipticity correlation function, and that the 
PSF distortion for KiDS contributes predominantly to the 
correlation, only signal leaking into the correlation 
on large scales. The final panel on the right shows the two- 
point PSF residual ellipticity correlation function which is 
two orders of magnitude lower than the PSF distortion and 
within the requirements for a future cosmological analysis of 
the full KiDS area, as discussed in §3.2.2. 


APPENDIX C: KiDS SHEAR AND 
PHOTOMETRIC REDSHIFT CATALOGUES: 
SOURCE LIST COLUMNS 

The following table lists the columns that are present in 
the KiDS-DR2 shear and photometric redshift catalogues 
that are made publicly available to download from http: 
//kids.strw.leidenuniv.nl. We provide three catalogues, 
one for each GAMA held G09, G12 and G15, in ldac-fits^^ 
format. The survey conhguration is shown in hg. 1 of Viola 
et al. (2015). Please note that the sources and masks in 

LDAC tables can be read with most FITS reading tools, or 
with specialized Theli tools from http://marvinweb.astro. 
uni-bonn.de/data_products/THELIWWW/ . 


these catalogues do not correspond exactly to those in the 
ESO public data release catalogues presented by dej-|-15, 
who did not use the Theli reduction that is the basis of 
the lensing catalogues described here. We recommend that 
users apply the following selection criteria on the masks, 
the lensdt weight, SNR, and the bpz photometric redshift 
in order to measure a robust lensing signal: MAN_MASK = 0, 
weight > 0, 0.005 < z_B < 1.2 and SNratio > 0. Objects 
that appear in overlapping tiles have already been removed 
from these region compiled catalogues. Redshift distribu¬ 
tions should be calculated from the full photometric red¬ 
shift probability distribution p{z), labelled PZ_full. Multi¬ 
plicative, m_cor_best, and additive calibration corrections, 
cl_best and c2_best, should also be applied to the shear 
estimates. 

In the following we provide additional information on 
certain columns in the catalogue. 

• KIDS_TILE is the name of the KiDS survey tile the 
source falls on. Searching the ESO archive with this OBJECT 
name will link to the Astro- WISE images of this tile. 

• THELI_NAME gives the Theli name of the KiDS survey 
tile. For scripting reasons these names replace the and 
characters with ‘p’ and ‘m’, respectively. 

• MAG_GAAP_f_CALIB gives the GAaP aperture flux in fil¬ 
ter f (for filters u, g, r and i), cross-calibrated to the SDSS 
photometry (S4.3). For each source these photometric pa¬ 
rameters are given for a single aperture, specified by the 
major and minor axis lengths given in the Agaper, Bgaper 
and PAgaap keywords. The fluxes are measured from the 
Astro-WISE co-added u-, g-, r- and i-band images, using 
the sky positions from the Theli r-band source catalogue. 
Note that these aperture magnitudes are mainly intended 
to be used for colour measurements, since they refer only to 
the central regions of the source. They are not total mag¬ 
nitudes except in the case of unresolved sources (see § 4.2 
and A2). Approximate total magnitudes may be obtained 
by taking the MAG_AUT0 magnitude, which is based on the r- 
band images, applying the extinction correction, and adding 
the GAaP colours. 

• Flag_GAAP_f is set to 100 when no GAaP flux could be 
measured in band f, 0 otherwise. 

• PZ_full: This is the full photometric redshift proba¬ 
bility distribution p(z) from 0.005 < z < 3.505. There are 
70 columns sampling p{z) at intervals of dz = 0.05. The 
hrst bin is centred at 2 = 0.03 and we recommend a linear 
or spline interpolation between the mid-points of each bin 
to recover a smooth redshift distribution from a sample of 
galaxies. There is a hnal bin not included in these catalogues 
with z > 3.505, such that in a small number of cases p(z) 
will not sum to 1. In a lensing analysis it is highly unlikely 
that a galaxy at such a high redshift would have a plausi¬ 
ble shape measurement, and we recommend applying a hard 
prior of zero probability past 2 : > 3.505. 

• el or e2: lensdt shear estimators. Note that the e2 com¬ 
ponent is dehned relative to the ALPHA_J2000,DELTA_J2000 
co-ordinate grid. Depending on how users dehne their an¬ 
gles in this reference frame, they may find that they need to 
change the sign of e2. 

• TILE_PSF_SYS_DK: The star-galaxy ellipticity correla¬ 
tion pass/fail flag, as described in §5.3, which applies on 
a tile-by-tile basis. Applying this field selection is required 
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Figure Bl. A check plot for photometry. A single sanity check plot is generated for each pointing for inspection. See text for details. 
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PSF Model PSF Residual KIDS_129pO_mlp5 



Figure B2. A check plot for PSF modelling. A single check plot is generated for each pointing for inspection. See text for details. 
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to reduce systematic contamination from the PSF for cosmic 
shear science. It is not, however, required for galaxy-galaxy 
lensing or cluster lensing science. 

• m_cor and m_cor_best: The multiplicative calibration 
correction which should be applied in an ensemble average, 
rather than on a galaxy-by-galaxy basis. m_cor was the cor¬ 
rection used by the early science papers that accompany 
this release which did not include the correct pixel scale (see 
§5.1.1 for discussion). m_cor_best is the calibration correc¬ 
tion used in the cosmic shear analysis in this paper, with the 
correct pixel scale applied. 

• cl and c2: The additive calibration correction used 
in the early science papers that accompany this release. 
These should be subtracted from the lens&t shear estimators 
el and e2. Some regions in the SNR-size-Strehl parameter 
space are too sparsely populated to calculate this empirical 
correction, resulting in a few tens of galaxies raising a flag 
value for cl and c2 of —99. These galaxies should either be 
removed from the analysis, or the correction should not be 
applied. 

• cl_best and c2_best: An improved estimate of the ad¬ 
ditive calibration correction used in the cosmic shear analy¬ 
sis in this paper. These should be used in conjunction with 
the pass/fail field selection TILE_PSF_SYS_OK. See above for 
flagged values. 

• PSF_el and PSF_el_exp [k]: Model PSF ellipticities at 
the location of the object, in this case the real part of epsF- 
PSF_el is the average PSF ellipticity over all sub-exposures 
that included the object, whereas PSF_el_exp [k] refers to 
the single sub-exposure number k. In cases where objects are 
not imaged in the sub-exposure, PSF_el_exp [k] is given a 
value of -99. 
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Table Cl. Columns provided in the KiDS lensing catalogues. 


Label 

Description 

Units 

KIDS_TILE 

Name of survey tile 


THELI_NAME 

Theli name for the tile 


SExtractor^ parameters derived from the Theli r-band co-added image: 


SeqNr 

Running object number within the catalogue 


FLUX_AUT0 

r-band flux 

counts 

FLUXERR.AUTO 

Error on FLUX.AUTO 

counts 

MAG_AUT0 

r-band magnitude 

mag 

MAGERR_AUT0 

Error on MAG.AUTO 

mag 

KR0N_RADIUS 

Scaling radius of the ellipse for magnitude measurements 


BackGr 

Background counts at centroid position 

counts 

Level 

Detection threshold above background 

counts 

MU_THRESHQLD 

Detection threshold above background 

mag arcsec“^ 

MaxVal 

Peak flux above background 

counts 

MU_MAX 

Peak surface brightness above background 

mag arcsec“^ 

IS0AREA_WQRLD 

Isophotal area above analysis threshold 

deg^ 

Xpos 

Centroid x position in the Theli image 

pix 

Ypos 

Centroid y position in the Theli image 

pix 

ALPHA_J2000 

Centroid sky position right ascension (J2000) 

deg 

DELTA_J2000 

Centroid sky position declination (J2000) 

deg 

A_W0RLD 

Profile RMS along major axis 

deg 

B_W0RLD 

Profile RMS along minor axis 

deg 

THETA_J2000“ 

Position angle (West of North)“ 

deg 

ERRA_W0RLD 

World RMS position error along major axis 

deg 

ERRB_W0RLD 

World RMS position error along minor axis 

deg 

ERRTHETA_J2000 

Error on THETA_J2000 

deg 

FWHM_IMAGE 

FWHM assuming a gaussian object profile 

pix 

FWHM_W0RLD 

FWHM assuming a gaussian object profile 

deg 

Flag 

SExtractor extraction flags 


FLUX_RADIUS 

Half-light radius 

pix 

NIMAFLAGS_ISD 

Number of flagged pixels 


CLASS.STAR 

Star-galaxy classifier 


Other parameters derived from the Theli r-band co-added image: 


MAN_MASK 

Final masking flag value, including both manual and automated masks (0 = no mask) 


SG.FLAG 

Star-galaxy separator from 2nd and 4th image moments (0=star, l=galaxy) 


Parameters derived from the Astro-WISE u,-, g- , r- and 2 -band co-added images: 


Agaper 

Major axis of GAaP aperture 

arcsec 

Bgaper 

Minor axis of GAaP aperture 

arcsec 

PAgaap 

Position angle of major axis of GAaP aperture (North of West) 

deg 

MAG_GAAP_[ugri]_CALIB 

GAaP [ugri] magnitude, zero-point and extinction-calibrated 

mag 

MAGERR_GAAP_[ugri] 

Error on MAG_GAAP_[ugri]_CALIB 

mag 

Flag_GAAP_[ugri] 

Flag for MAG_GAAP_ [ugri] _CALIB 


EXTINCTION.[ugri] 

Galactic extinction in the ugri band 

mag 

MAG_LIM_[ugri] 

1-a limiting magnitude in the ugri band 

mag 

Z_B 

BPZ redshift estimate; peak of posterior probability distribution 


Z_B_MIN 

Lower bound of the 95% confidence interval of Z_B 


Z_B_MAX 

Upper bound of the 95% confidence interval of Z_B 


T_B 

Spectral type corresponding to Z_B 


ODDS 

Empirical ODDS of Z_B 


Z_ML 

BPZ maximum likelihood redshift 


T_ML 

Spectral type corresponding to Z_ML 


CHI.SQUARED.BPZ 

value associated with Z_B 


BPZ_FILT 

Filters with photometry used in BPZ; bit-coded mask 


NBPZ_FILT 

Number of filters with good photometry used in BPZ 


BPZ.NONDETFILT 

Filters with faint photometry (not used in BPz); bit-coded mask 


NBPZ.NONDETFILT 

Number of filters with faint photometry 


BPZ.FLAGFILT 

Filters with flagged photometry (not used in BPz); bit-coded mask 


NBPZ.FLAGFILT 

Number of filters with flagged photometry 


PZ_full 

Vector containing the posterior photo- 2 : probability 



Continued on next page 
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Label 


Description 


Units 


Parameters derived with lensfit on the Theli r-band sub-exposures: 


weight 
fitclass 
scalelength 
bulge_fraction 
model_flux 
SNratio 
PSF_el 
PSF_e2 

PSF_Strehl_ratio 

catmag 

n_exposures_used 

PSF_el_expl 

PSF_e2_expl 


inverse variance weight 

fit class (class=0 means a galaxy, no issue) 
galaxy model scale length 
galaxy model bulge fraction 
galaxy model flux 
SNR for model fit 

mean ellipticity of PSF, component 1 
mean ellipticity of PSF, component 2 

Pseudo-Strehl ratio of PSF (flux fraction in central pixel) 
r-band magnitude used to calculate the size prior 
Number of sub-exposures used 

PSF model ellipticity component 1, on sub-exposure 1 
PSF model ellipticity component 2, on sub-exposure 1 


pix 

counts 


PSF_el_exp5 

PSF_e2_exp5 

el 

e2 

cl 

c2 

in_cor 

cl_best 

c2_best 

in_cor_best 


PSF model ellipticity component 1, on sub-exposure 5 
PSF model ellipticity component 2, on sub-exposure 5 
Galaxy ellipticity ei (no c or m correction) 

Galaxy ellipticity €2 (no c or m correction) 

Additive bias of ei based on SNR, scale length and Strehl 

Additive bias of 62 based on SNR, scale length and Strehl 

Multiplicative bias of ei and 62 based on SNR, scale length and Strehl 

Updated additive bias of ei based on SNR, scale length and Strehl 

Updated additive bias of 62 based on SNR, scale length and Strehl 

Updated multiplicative bias of ei and 62 based on SNR, scale length and Strehl 


Parameters derived from systematics tests on the catalogue: 

TILE_SYS_OK Tile pass/fail flag based on star-galaxy ellipticity correlation (pass=l, fail=0) 

“ These catalogues were created with SExtractor version 2.2.2. Note that from version 2.4.6 onwards the definition of THETA_J2000 

was changed from West-of-North to East-of-North, i.e., the sign flipped. 
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